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A NEW ESSENTIAL DOWNSTREAM COMPONENT OF THE WINGLESS SIGNALLING 

PATHWAY 



The present invention relates to a new essential downstream com- 
ponent of the wingless signaling pathway. In particular, the in- 
vention relates to nucleotide sequences of the Drosoph'ila mela- 
nogaster daughter of legless {doll) gene, of its encoded pro- 
teins, as well as derivatives, fragments and analogues thereof. 
The invention includes vertebrate and invertebrate homologues of 
the Doll protein, comprising proteins that contain a stretch of 
amino acids with similarity to the Drosophila Doll gene . Methods 
for producing the Doll protein, derivatives and analogs, e.g. by 
recombinant means, and antibodies to Doll are provided by the 
present invention as well . The invention also relates to methods 
for performing high throughput screening assays for compounds 
modulating Doll function in the Wnt pathway. 

Background of the invention 

Wnt genes encode a large family of secreted, cystein rich pro- 
teins that play key roles as intercellular signaling molecules 
in a wide variety of biological processes (for an extensive re- 
view see (Wodarz and Nusse 1998) . The first Wnt gene, mouse wnfc- 
1, was discovered as a prot o- oncogene activated by integration 
of mouse mammary tumor virus in mammary tumors (Nusse and Varmus 
19 82) . Consequently, the involvement of the Wnt pathway in can- 
cer has been largely studied. With the identification of the 
Drosophila polarity gene wingless (wg) as a wnt-2 homologue 
(Cabrera, Alonso et al . 1987; Perrimon and Mahowald 1987; Rijse- 
wijk, Schuermann et al . 1987), it became clear that wnt genes 
are important developmental regulators. Thus, although at first 
glance dissimilar, biological processes like embryogenesis and 
carcinogenesis both rely on cell communication via identical 
signaling pathways. In a current model of the pathway, the se- 
creted Wnt protein binds to Frizzle cell surface receptors and 
activates the cytoplasmic protein Dishevelled (Dsh) . Dsh then 
transmits the signal to a complex of several proteins, including 
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the protein kinase Shaggy/GSK3, the scaffold protein Axin and P~ 
Catenin, the vertebrate homologue of Armadillo. In this complex 
p-Catenin is targeted for degradation after being phosphorylated 
*>y s 55- After Wnt signaling and the resulting down- regulation of 
Sgg activity, P-Catenin (or its Drosophila homologue Armadillo) 
escape from degradation and accumulate into the cytoplasm. Free 
cytoplasmic p-Catenin translocates to the nucleus by a still ob- 
scure mechanism, and modulates gene transcription through bind- 
ing the Tcf/Lef family of transcription factors (Grosschedl R 
1999) . 

This set up, in which the key transducer is continuously held in 
check, is highly susceptible to mutations in its inhibitory com- 
ponents. The loss of any of the three elements of the P-Catenin 
destruction complex leads to an increase in p-Catenin levels, 
and hence to the constitutive activation of the pathway. While 
this may reduce cellular viability, as upon loss of GSK-3 func- 
tion, it can also lead to cell fate changes, uncontrolled pro- 
liferation and tumorigenic behavior as in the cases of APC and 
Axin (Barker N 1999; Morin 1999; Potter 1999; Roose and Clevers 
1999; Waltzer and Bienz 1999) . Attempts to counter these harmful 
situations must aim at curbing the nuclear activities of p- 
Catenin, either by preventing the formation of the p-Catenin-TCF 
complex or by interfering with its transcriptional activator 
function. 

Currently, there are no known therapeutic agents effectively in- 
hibiting p-Catenin transcriptional activation. This is partly 
due to the fact that many of the essential components required 
for its full activation and nuclear translocation are still un- 
known. Consequently, there is an urge to understand more about 
this pathway in order to be able to develop effective drugs 
against these highly malignant diseases . 

In order to identify new components required for Wingless acti- 
vation the inventors used a Drosophila genetic approach to 
screen for dominant suppressors of the rough eye phenotype 
caused by ectopic expression of Wingless, the Drosophila homo- 
logue of Wnt, during eye development. Three genes were identi- 
fied: the B-catenin homologue armadillo {arm) , the tcf/lef-1 
homologue pangolin (pan) and legless dgs) , a completely new 
gene (US 09/915.543). The Igs gene was subsequently cloned and 
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its In vivo requirement for Wingless signal transduction in em- 
bryo and in developing tissues was confirmed. The presence of 
Lgs is required for a transcriptional active Arm/Pangolin com- 
plex and over-expression of Lgs strongly stimulates the tran- 
scriptional output of this bipartite transcription factor. The 
human genome contains at least two human Lgs homologues . One of 
them, Bcl9, has been previously implicated in B cell malignan- 
cies (Willis, Zalcberg et al . 1998). It was also genetically and 
biochemically demonstrated that dLgs and hLgs bind to Armadillo 
and (3-Catenin and are functionally required for Wnt signal 
propagation in human cells. However, genetic experiments 
strongly suggested the presence of a second protein which binds 
to Lgs and is essential for the function of the active p~ 
Catenin- Pangolin-Lgs complex . 

The present invention describes the cloning and functional char- 
acterization of a novel Drosophila protein, named Daughter of 
Legless (Doll) , which binds to Lgs and is required for Wnt sig- 
naling. In addition, the invention provides the sequences of the 
functional and structural human and mouse homologues as well as 
methods to screen for compounds inhibiting Doll function in the 
Wnt pathway. 

Definitions 

The term "Doll polypeptide", "Doll protein" when used herein en- 
compasses native invertebrate and vertebrate Doll and Doll vari- 
ant sequences (which are further defined herein) . 

A "wild type sequence Doll" comprises a polypeptide having the 
same amino acid sequence as a Doll protein derived from nature. 
Such wild type sequence of Doll can be isolated from nature or 
produced by recombinant and/or synthetic means. The term "wild 
type sequence Doll" specifically encompasses naturally occurring 
truncated forms, naturally occurring variant forms (e.g., alter- 
natively spliced forms) and naturally occurring allelic variants 
of Doll. In one embodiment of the invention, the wild type Doll 
sequence is a mature or full-length Doll sequence comprising 
amino acids 1 to 815 of dDoll (Figure 1) , or 1 to 419 of hDoll- 
1, or 1 to 406 of hDoll-2 (Figure 2), or 1 to 417 of mDoll-i, or 
1 to 407 of mDoll-2 (Figures 3) . 
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"Doll variant" means an active Doll, having at least about 50% 
amino acid sequence identity with the amino acid ' sequence of a 
wild type Doll protein of Figure 1, 2 and 3. The term „Doll 
variant" however, does also include functional homologues of 
Doll in the Wnt pathway. 

"Percent (%) amino acid sequence identity'' with respect to the 
Doll sequences identified herein is defined as the percentage of 
amino acid residues in a candidate sequence that are identical 
with the amino acid residues in the Doll sequence, after align- 
ing the sequence and introducing gaps, if necessary, to achieve 
the maximum percentage sequence identity, and not considering 
any conservative amino acid substitution as part of the sequence 
identity. The % identity values used herein can be generated by 
WU-BLAST-2, which was obtained from (Tatusova TA 1999) . WU- 
BLAST- 2 uses several search parameters, most of which are set to 
the default values. 

The term "positive" , in the context of sequence comparison per- 
formed as described above, includes residues in the sequence 
compared that are not identical but have similar properties 
(e.g. as a result of a conservative substitution). The % value 
of positive is determined by the fraction of residues scoring a 
positive value in the BLOSUM 62 matrix divided by the total num- 
ber of residues in the longer sequence as defined above. 

In a similar manner, "percent (%) nucleic acid sequence iden- 
tity" with respect to the coding sequence of the Doll polypep- 
tides identified herein is defined as the percentage of nucleo- 
tide residues in a candidate sequence that are identical with 
the nucleotide residues in any of the Doll coding sequences of 
this invention. The identity values used herein can be generated 
using BLAST module of WU-BLAST-2 set to the default parameters. 

The term "epitope tag" refers to a chimeric polypeptide compris- 
ing a Doll polypeptide fused to a „tag polypeptide". The tag 
polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough that it does 
not interfere with the activity of the Doll polypeptide to which 
it is fused. 

Nucleic acids are "operably linked" when are placed in a func- 
tional relationship with another nucleic- acid sequence. 
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The term „epistasis" means hierarchy in gene action. Epistasis 
experiments are performed to place components of a signaling 
pathway in the right order. 

The term „ rescue experiments" are designed to determine which 
gene is responsible for a specific mutant phenotype. Specifi- 
cally, mutant embryos are injected with coding or genomic DNA, 
and the effect of the introduced DNA is determined on the basis 
of the capacity to revert the mutant phenotype. 

„ Active * or „ activity " refers to forms of Doll polypeptides that 
retain the biological and/or immunological activity. A preferred 
activity includes for instance the ability to modulate the Wnt 
signaling pathway. 

The term "antagonist" is used in a broad sense, and includes any 
molecule that partially or fully inhibits, blocks or neutralizes 
a biological activity of Doll polypeptides described herein. In 
a similar way, the term "agonist" is used in the broadest sense 
and includes any molecule that mimics or support a biological 
activity of an active Doll polypeptide. 

„ Treatment" refers to both therapeutic treatments and prophylac- 
tic or preventive measures, wherein the objective is to prevent 
or slow down the targeted pathologic condition or disorder. 
Those- in need of treatment include those already with the disor- 
der as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. 

Summary of the invention 

The present invention relates to a novel family of proteins pre- 
sent in insects and vertebrate organisms, referred to hereinaf- 
ter as "Daughter of Legless (Doll)" proteins. These proteins 
play an essential role in the Wnt signaling pathway, and thus in 
the formation and maintenance of spatial arrangements and pro- 
liferation of tissues during development, and in the formation 
and growth of many human tumors . 
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In particular, the invention relates to nucleotide sequences of 
the Drosophila melonogaster doll gene, of proteins encoded by- 
said nucleotide sequences, as well as fragments, derivatives and 
structural and f unctional analogs thereof . 

In a preferred embodiment the invention relates to the nucleo- 
tide and protein sequences of the human and mouse doll homo- 
logues, hdoll-1, hdoll-2 and mdoll-1 and mdoll-2, respectively. 

In one embodiment, the isolated nucleic acid comprises a se- 
quence encoding a polypeptide having at least 50% amino acid se- 
quence identity, preferably at least about 7 0% sequence iden- 
tity, more preferably at least 90% sequence identity, even more 
preferably at least about 95% sequence identity, yet even more 
preferably at least about 98% sequence identity, and most pref- 
erably 100% identity to (a) a fragment or the entire protein se- 
quence of the Doll polypeptide shown in Figure 1, or (b) the 
complement of the nucleic acid molecule coding for (a) . 

In another preferred embodiment, the isolated nucleic acid en- 
codes a polypeptide having at least 50% amino acid sequence 
identity, preferably about 70% sequence identity, more prefera- 
bly at least 90% sequence identity, even more preferably about 
95% sequence identity, yet even more preferably about 98% se- 
quence identity, and most preferably 10 0% identity to (a) a 
polypeptide which is part or the entire human Doll polypeptides 
of figure 2a/b or (b) the complement of the nucleic acid mole- 
cule coding for (a) . 

In another embodiment, the isolated nucleic acid encodes a poly- 
peptide sequence having at least 50% amino acid sequence iden- 
tity, preferably about 70% sequence identity, more preferably at 
least 90% sequence identity, even more preferably about 95% se- 
quence identity, yet even more preferably about 98% sequence 
identity, and most preferably 100% identity to (a) a polypeptide 
encoding part of the entire mouse Doll protein of figure 3 a/b 
or (b) the complement of the nucleic acid molecule coding for 
(a) . 

In a further embodiment, the isolated nucleic acid comprises a 
sequence encoding a polypeptide with a low overall amino acid 
sequence identity but shows a sequence identity of at least 60%, 
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preferably at least 7 0%, more preferably at least 80%, even more 
preferably at least 9 0% and most preferably 100% in the con- 
served domains DHD and PHD (Figure 4) . 

In yet another embodiment of the present invention isolated nu- 
cleic acids encode polypeptides having a function resembling 
that of the doll genes . 

In another embodiment, the invention relates to a fragment of 
the Drosophila or vertebrate doll nucleic acid sequences that is 
applied as hybridization probe. Such nucleic acid fragments are 
about 2 0 to about 100 nucleotides in length, preferably from 
about 20 to about 60 nucleotides in length, most preferably from 
2 0 to 50 nucleotides in length and are derived from the nucleo- 
tides sequences shown in Figure 1, 2 and 3. 

The invention further provides eucaryotic and procaryotic ex- 
pression vectors comprising a nucleic acid molecule encoding 
Drosophila or vertebrate dolV or a fragment thereof as shown in 
Figures 1, 2 and 3, The vector can comprise any of the molecules 
or fragments thereof described above . 

The invention also includes host cells comprising such a vector. 
By way of example, the host cells can be mammalian cells, yeast 
cells, insect cells, plant cells or bacteria cells. 

Methods of production, isolation and purification of the Doll 
proteins, derivatives and analogs, e.g. by recombinant means, 
are also provided (see Example VI, below) . In a specific aspect, 
the invention concerns an isolated Doll peptide, comprising an 
amino acid sequence of at least 80%, preferably at least about 
85% sequence identity, more preferably at least 90% sequence 
identity, even more preferably at least 95% sequence identity, 
yet most preferably 10 0% identity with the amino acid sequences 
of Figures 1, 2 and 3. 

In yet another embodiment the invention relates to chimeric pro- 
teins comprising a Doll polypeptide fused to a heterologous 
polypeptide or amino acid sequence . An example of such chimeric 
molecule comprises a Doll polypeptide fused to an epitope tagged 
sequence, glutathione-S- transferase protein or to a protein witb 
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an enzymatic activity, such as beta-galactosidase or alkaline 
phosphatase as described in Example VI below. 

In a further aspect the invention concerns an isolated full 
length Doll polypeptide (prepared as described in Example VI) , 
comprising the amino acid sequences of Figure 1, 2 and 3, or any 
Doll polypeptide or a fragment thereof described in this inven- 
tion sufficient to provide a binding site for an anti-Doll anti- 
body. 

In another embodiment the invention provides antibodies that 
specifically recognize Doll polypeptides. The antibodies can be 
a polyclonal or a monoclonal preparation or fragments thereof. 
Polyclonal antibodies are prepared by immunization of rabbits 
with purified Doll polypeptides prepared as described in Example 
VI . 

The invention also relates to transgenic animals, e.g. Droso- 
phlla, mice, rats, chicken, frogs, pigs or sheep, having a 
transgene, e.g., animals that include and preferably express a 
heterologous form of the Doll genes described herein, or that 
misexpress an endogenous or transgenic doll gene. Such a trans- 
genic animal can serve as a model for studying diseases with 
disrupted Wnt signaling pathway, for the production of Doll pro- 
teins, or for drug screening. 

In yet another embodiment, the invention also features animals, 
e.g. Drosophila, mice, rats, chicken, frogs,, pigs or sheep, hav- 
ing a mutation in the doll gene, e.g. deletions, point muta- 
tions, foreign DNA insertions or inversions. Such animals can 
serve to study diseases characterized by disrupted Wnt function 
or in drug screening. 

In addition, the invention relates to the use of Doll proteins, 
homologues, derivatives and fragments thereof as well as nucleic 
acids, derivatives and fragments thereof in therapeutic and di- 
agnostic methods and compounds. In particular, the invention 
provides methods and compounds for treatment of disorders of 
cell fate, differentiation or proliferation by administration of 
a therapeutic compound of the invention. Such therapeutic com- 
pounds include: Drosophila and vertebrate Doll protein homo- 
logues or fragments thereof, antibodies or antibody fragments 
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thereto, doll antisense DNA or RNA, doll double stranded RNA, 
and any chemical or natural occurring compound interfering with 
Doll function, synthesis or degradation. In particular, the in- 
vention provides methods to screen for chemical compounds, or- 
ganic products or peptides interfering with Doll function in the 
Wnt pathway. In a preferred embodiment the screening method will 
be a cellular reporter gene assay or a protein-protein interac- 
tion assay. 

In another embodiment, a screening assay based on protein- 
protein interaction is used to screen for compounds specifically 
inhibiting Doll-Lgs or Doll- interaction partner X. 

The invention also provides methods to screen for chemical com- 
pounds, organic products or peptides interfering with Doll func- 
tion in the Wnt pathway. 

Furthermore, the invention comprises the use of the DHD domain 
in screening assays such as an in vitro protein-protein interac- 
tion assay or a protein-protein interaction in a host cell. Said 
assays are applied for the identification of chemical compounds, 
organic products, polypeptides or peptides interfering with Doll 
function in the Wnt pathway. 

In a preferred embodiment, a therapeutic product according to 
the invention is administered to treat a cancerous condition or 
to prevent progression from a pre -neoplastic or non-malignant 
condition to a neoplastic or malignant state. 

In other specific embodiments, a therapeutic product of the in- 
vention is administered to treat a blood disease or to promote 
tissue regeneration and repair. Finally disorders of cell fate, 
especially hyperprolif erative or hypoprolif erative disorders, 
involving aberrant or undesirable expression, or localization, 
or activity of the Doll protein can be diagnosed by detecting 
such levels. 
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Figure 1 The Drosophila doll cDNA and protein sequence. 

Figure 2 The human doll-1 and doll-2 cDNA and protein sequence. 

Figure 3 The mouse doll-1 and doll-2 cDNA and protein se- 
quences . 

Figure 4. Drosophila and human Doll proteins contain a PHD fin- 
ger motif with which they bind to the HD1 of Lgs/BCL9 

(A) Top: Schematic representation of Doll, human DOLL-1 and hu- 
man DOLL-2. The two domains that show high sequence similarities 
are highlighted in dark gray (DHD: Doll homology domain) and red 
(PHD: plant homology domain) . The DHD appears to be unique, as 
the inventors failed to find a similar sequence in other Droso- 
phila or human proteins. GenBank accession numbers for Doll, 
hDoll-1, hDoll-2 are AF457206, AF457207, AF457208, respectively. 
Bottom: Multiple alignment of Drosophila, human and mouse Doll 
protein sequences . 

(B) Alignment of amino acid sequences of DHD and PHD in Doll and 
its human homologues. . 

Similarities are boxed, identities shaded in gray. The numbers 
to the left indicate the positions of DHD and PHD within their 
respective protein sequences. For the DHD alignment a gap of 22 
aa has been introduced in the Drosophila DHD (represented as 
(X)5) . 

(C) Mapping of the Lgs/BCL9 interaction site in Doll. Schematic 
representation of the proteins tested in the yeast-two-hybrid 
assay for their interactions with Lgs and BCL9 . Results are in- 
dicated to the right ( n bdg") . 

(D) Mapping of the Doll interaction site in dLgs and hLgs/BCL9. 
Schematic representation of the proteins tested in the yeast - 
two-hybrid assay for their interactions with Lgs. The two pro- 
teins shown at the bottom were tested by a pull -down assay for 
both dLgs (numbers without brackets) and hLgs/BCL9 (numbers in 
parenthesis) with the same result ("bdg" ) . The deletion removing 
HD1 comprises aa 318-345 for Lgs and aa 177-204 for hLgs/BCL9 . 
Fusion proteins used were S-Tag-dDoll (aa 542-815) and GST- 
hDOLL-2 (aa 301-406) . 
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Figure 5. doll is a segment polarity gene required for Wg sig- 
nalling 

(A-C) Cuticle preparations of larvae derived from wild- type (A) , 
wg mutant (B) , and doll mutant embryos (C) . The doll 130 /doll 130 
embryo in (C) is derived from a homozygous doll 130 mutant germ 
line clone (see Experimental Procedures) and displays a wg-like 
phenotype . 

(D, E) doll functions downstream of dAPC2 . Two cuticle prepara- 
tions are shown from larvae that developed in the absence of ma- 
ternal and zygotic wild-type dAPC2 function (McCartney et al . , 
1999) . The embryo in (E) additionally lacks the maternal and zy- 
gotic function of doll (see Examples) . In contrast to dAPC sin- 
gle mutant animals, which have strongly reduced denticle belts, 
double mutants display a doll-like phenotype. 

(F-I) Confocal images of third ins tar wing imaginal disc prepa- 
rations stained with antibodies against Doll (F,G) and Ptc 
(H,I) . Wild- type animals show normal expression of these genes 
(F,H) . Discs derived from doll 130 mutant larvae are small, yet 
express Ptc (I) , but fail to express Dll (H) . Lack of Dll ex- 
pression may be an indirect consequence of the earlier wing-to- 
notum transformation in doll 130 larvae. However, we also see a 
strong reduction of Dll expression in doll 130 mutant cells from 
mosaic animals (not shown) . 

Figure 6. Lgs and the PHD finger of Doll serve to assemble Doll 
and Arm 

Schematic representation of Lgs (yellow) and Doll (light green) 
constructs that were used in transgene assays to assess their 
ability to rescue lgs or doll mutant animals. 1: Full-length Lgs 

(pOP216, aa 1-1464) . 2: C-terminally truncated Lgs (pTK131, aa 
1-583). 3: HD1-Galll-HD2 (pTK153, aa 268-395 (HD1) , aa 369-500 

(Galll), aa 465-596 (HD2) ) . 4: HD1- (HA) 3 -HD2(pTK143, aa 268-395 

(HDD, aa 465-596 (HD2 ) ) . 5: Full-length Doll (pTK56, aa 1-815). 
6 : Doll [DPHD] -HD2 (pTK135, aa 1-740 (Doll [DPHD] ) , aa 483-561 

(HD2) ) . Transgene s 1 to 4 are 

able to rescue lgs 20F homozygotes . An example for an adult animal 
rescued by transgene 3 is shown on the right. Transgene 5 can 
rescue doll 130 homozygotes. Transgene 6 can rescue doll 130 as well 
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as lgs 20F homozygotes (photographs on the right) . 

Figure 7: Rescue of ddoll-/- flies by expression of a human doll 
transgene. The lethality caused by the doll 130 / EP(3)1076 geno- 
type can be fully rescued by a tubulin 1 promoter-driven trans- 
gene that contains either the coding region of the Drosophila 
doll gene (not shown) or that of one of its two human homologues 
hdoll-1 and hdoll-2. 

Figure 8: Effects of human Doll 1 and 2 on Tcf transcription. 

2 93 cells were transiently transfected with the pTOPFLASH or 
p FOP FLASH luciferase reporters and different effector plasmids 
as indicated. A const i tut ively active - form of P-Catenin (AN-p- 
Catenin, 50 ng) or human Doll-1 or hDoll-2 (350 ng) activate 
the pTOPFLASH reporter. Cotransf ection of human Doll with AN-p- 
catenin strongly enhance the response. 

Detailed description of the invention 

The Wnt signaling cascade is essential for the development of 
both invertebrates and vertebrates, and has been implicated in 
tumorigenesis. The Drosophila wg genes are one of the best char- 
acterized within the Wnt-protein family, which includes more 
than hundred genes. In the Drosophila embryo, W g is required for 
formation of parasegment boundaries and for maintenance of en- 
grailed (en) expression in adjacent cells. The epidermis of em- 
bryo defective in wg function shows only a rudimentary segmenta- 
tion, which is reflected in an abnormal cuticle pattern. While 
the ventral cuticle of wild type larvae displays denticle belts 
alternating with naked regions, the cuticle of wg mutant larvae 
is completely covered with denticles. During imaginal disc de- 
velopment, wg controls dorso-ventral positional information. In 
the leg disc, wg patters the future leg by the induction of ven- 
tral fate (Struhl and Basler 1993) . In animals with reduced wg 
activity, the ventral half of the leg develops into a mirror im- 
age of the dorsal side (Baker 1988) . Accordingly, reduced wg ac- 
tivity leads to the transformation of wing to notal tissue, 
hence the name of the gene (Sharma and Chopra 1976) . In the eye 
disc, wg suppresses ommatidial differentiation in favor of head 
cuticle development, and is involved in establishing the dorso- 
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ventral axis across the eye field (Heberlein, Borod et al . 
1998) . 

Additional genes have been implicated in the secretion, recep- 
tion or interpretation of the Wg signaling. For instance, ge- 
netic studies in Drosophila revealed the involvement of frizzled 
(Dfz) , Dishevelled (dsh) , shaggy/ zeste- white- 3 {sgg/zw-3) , arma- 
dillo (arm) , adenomatous polyposis coli (E-apc) , axin, and pan- 
golin (pan) in Wg signaling. The genetic order of these trans- 
ducers has been established in which Wg acts through Dsh to in- 
hibit Sgg, thus relieving the repression of Arm by Sgg, result- 
ing in the cytoplasmic accumulation of Arm and its translocation 
to the nucleus. In the nucleus Arm interacts with Pan to acti- 
vate transcription of target genes. Vertebrate homologues have 
been identified for all these components (for an updated review 
see (Peif er and Polakis 2000) , suggesting that novel identified 
members of the Drosophila signaling pathway may likely have ver- 
tebrate counterparts . 

Mutations leading to nuclear accumulation of the mammalian homo- 
logue of Arm, £-Catenin, and consequently to constitutive acti- 
vation of the Wnt pathway have been observed in many types of 
cancer, including colon cancer, breast cancer, melanoma, hepato- 
cellular carcinoma, ovarian cancer, endometrial cancer, medullo- 
blastoma pilomatricomas , and prostate cancer (Morin 1999; Po- 
lakis, Hart et al . 1999). It is now apparent that deregulation 
of |3-Catenin signaling is an important event in the genesis of 
these malignancies. However, there are still no known therapeu- 
tic agents effectively inhibiting S-Catenin transcriptional ac- 
tivation. This is partly due to the fact that many of the essen- 
tial components required for its full activation and nuclear 
translocation are still unknown. 

In order to identify new components required for Wingless acti- 
vation the inventors used a Drosophila genetic approach to 
screen for dominant suppressors of the rough eye phenotype 
caused by ectopic expression of Wingless (Wg) , the Drosophila 
homologue of Wnt, during eye development. A new gene, legless 
(lgs, US09/915 .543) was identified as a strong dominant suppres- 
sor of the rough eye phenotype. The gene was . subsequently cloned 
and its in vivo requirement for Wg signal transduction in embryo 
and ih developing tissues was confirmed. The human genome con- 
tains at least two human Lgs homologues, h!igs/Bcl9 and hLgs-1. 
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dLgs and hLgs bind to Armadillo and p-Catenin and are function- 
ally required for Wnt signal propagation in invertebrate and 
vertebrate cells (US 09/915.543). In particular, the presence of 
Lgs is required for a transcriptional active Arm/Pangolin com- 
plex and over-expression of Lgs strongly stimulates the tran- 
scriptional output. 

The inventors later made the interesting observation that a mu- 
tant form of Lgs protein from which the p-Catenin interacting 
domain was deleted exhibited a strong dominant -negative effect 
on Wg-dependent patterning processes when expressed from a 
transgene in wild- type larvae (data not shown) . This strongly 
suggested that Lgs normally interacts not only with Arm, but 
also with at least one additional component. In an effort to 
identify such components yeast-two-hybrid screens for interact- 
ing proteins were carried out . In two independent screens in 
which either the entire protein or the N-terminal half of Lgs 
was used as a bait, a novel PHD finger protein, referred to as 
Daughter-of -Legless (Doll) , was identified as a Lgs binding pro- 
tein (Figure 4) . The 815 amino acid residue Doll protein carries 
a C- terminal domain of 60 amino acids (Figure 4a) , which shows 
extensive homologies to the PHD (plant homology domain) finger, 
also known as LAP (leukemia associated protein) domain (Aasland, 
Gibson et al . 1995) . This domain comprises a cysteine rich Zn- 
binding motif, that has been associated with proteins involved 
in chromat in-mediated regulation of transcription. The PHD fin- 
ger of Doll is necessary and sufficient to mediate the interac- 
tion to Lgs (Figure 4c -d) . The inventors also demonstrate herein 
that this interaction is essential for Doll function. 

The region of Lgs responsible for Doll -binding was mapped to the 
HD1 sequence (Figure 4d) . Moreover, two human homologues of the 
Drosophila doll gene wtere identified and isolated (Figure 4a) . 
The protein products of both human genes, hDOLL-1 and hDOLL-2, 
as well as their mouse homologues possess a highly conserved PHD 
finger which interacts with the HD1 of hLgs/BCL9 (Figure 4d) . 
The only other domain in Drosophila Doll, hDoll-l/hDoll-2 and 
mDoll-l/mDoll-2 that shows significant sequence homology is a 50 
amino acid stretch in the N-terminal region, which is referred 
herein to as *Doll homology domain' (DHD, Figure 4a, b) . 

The interaction with Doll appears to be relevant for the in vivo 
function of Lgs, since a mutant form of Lgs with a deletion of 
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HD1 was unable to rescue lgs mutant animals . The physical asso- 
ciation of Doll and Lgs suggested that Doll, like Lgs, may be 
required for Wg signaling in vivo. To explore this hypothesis a 
proprietary collection of suppressors of the sev-wg phenotype 
was searched for mutations that map to the tip of the right arm 
of chromosome 3, the position of the doll gene. One such sup- 
pressor, Sup 130 , mapped to this position, and intriguingly , it 
showed dominant lethality in combination with the lgs allele 
lgs 17E (US09/915.543) (Sup 130 /+ lgs 17E /+ transheterozygous ani- 
mals do not survive) . The doll coding region was sequenced using 
genomic DNA from homozygous Sup 130 mutant larvae and a 14 bp de- 
letion starting at amino acid position 751 was identified. Hence 
this allele is referred to as doll 130 and encodes a truncated 
Doll protein lacking the C- terminal PHD finger. 

The lethality caused by the homozygous doll 130 genotype can be 
fully rescued by a tubulin 1 promoter -driven transgene that con- 
tains either the coding region of the Drosophila doll gene or 
that of one of its two human homologues hDoll-1 . and hDoll-2 
(Figure 7). Thus, the * vertebrate homologues of doll were con- 
firmed genetically to be true functional homologues of Doll, and 
hence the vertebrate homologues are part of this invention. 
To assay the possible role of Doll in Wg signal transduction 
during development, embryos homozygous for the doll 130 mutation 
that derived from female germ cells equally mutant for doll were 
generated. Doll mKNA is maternally contributed and strongly and 
ubiquitously expressed during all the developmental stages. Con- 
sequently, only embryos lacking both embryonal and maternal doll 
are characterized by a severe segment polarity phenotype (Figure 
5A-C) , while weaker loss of function doll mutants display pupal 
lethality with a partial or complete loss of the antennae and 
the legs. Mutant individuals lacking only zygotic function sur- 
vive until early pupal stages and exhibit imaginal discs that 
are abnormally small . The Hh target gene ptc was expressed at 
wild- type levels in these discs, however, no expression of the 
Wg target Dll could be detected (Figure 5F-I) . These discs ap- 
pear to lack the presumptive wing blade field and possess two 
primordia for the notum (Figures 5G and I) . The fact that simi- 
lar phenotypes are caused by loss of function of wg, dsh, arm, 
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or Igs confirms the essential role of doll in the Wg signaling 
pathway - 

To address the role of Doll in P-Catenin-mediated transcription 
a TCF reporter gene (TOPFLASH, (Morin, Sparks et al. 1997)) was 
used in immortalized human embryo kidney cells (HEK 293 cells) . 
Low levels of a stable mutant form of P-Catenin (AN-p-catenin; 
(van de Wetering, Cavallo et al . 1997)) were introduced into 
these cells to partially stimulate the pathway. The additional 
expression of hDoll-1 (Figure 8) or hDoll-2 (not shown) lead to 
a large increase in lucif erase activity (30 -fold) . These levels 
are significantly higher than the sum of those produced by ei- 
ther treatment alone (Figure 8) . This potentiation of P-Catenin 
activity by hDoll-1 and 2 appears to be mediated by the interac- 
tion of endogenous TCF protein with its DNA target sites, as it 
is only observed with TOPFLASH , which contains five optimal TCF 
binding sites, but not with the control reporter FOPFLASH, which 
contains five mutated sites (Morin, Sparks et al . 1997). Thus 
this experiment adds supportive evidence to the notion that Doll 
proteins transduce Wnt signals by activating TCF target genes in 
a p-Catenin- dependent manner. 

In summary, the protein-protein interactions demonstrated be- 
tween Drosophlla Doll and Lgs and those between their human 
homologues human Doll and hLgs/Bcl9, respectively, in conjunc- 
tion with the- genetic and cell biological data show that Doll 
proteins are positive regulators of the Wg and Wnt signaling 
pathways, respectively. 
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EXAMPLES 

Example I: Isolation of doll cDNA 

The cDNA for Daughter of Legless (Doll) was isolated in two in- 
dependent yeast genetic screens of a Drosophila cDNA- library for 
proteins directly binding to Lgs. Other DNA libraries can be 
used as well, such a genomic and cDNA libraries from vertebrate 
and invertebrate organisms. Other methods than a yeast-two hy- 
brid screen can be used as well. Such methods include, but are 
not limited to, direct amplification using gene specific prim- 
ers and standard methods known by people skill in the art. To 
perform the yeast two-hybrid screening for protein binding to 
Lgs cDNA sequences encoding the first 732 amino acids ( "LgsN" ) 
and the full-length protein of 1464 amino acids ( n LgsFL" ) were 
subcloned into a yeast expression vector (pLexA, Clontech) , fus- 
ing them to the LexA DNA-binding domain. Subsequently these con- 
structs were transformed into the LEU2- reporter yeast strain 
EGY48 together with the lacZ- reporter plasmid pSH18-34 and an 
embryonic Drosohila m&la.nogaster cDNA-library fused to an acidic 
transcriptional activation domain ( W RFLY-1" library, PNAS 93, 
3 011-3015) . In a first step triple- trans formant colonies con- 
taining the LgsN- or LgsFL-LexA- fusion constructs, respectively, 
the pSH18-34 reporter and a RFLY-1 library pTLasmid were grown on 
minimal selective medium plates for two days, harvested, thor- 
oughly mixed, and stored as uniform aliquots . Then cells from 
one of these aliquots were transferred into permissive Galac- 
tose/Raf f inose minimal selective liquid medium, and incubated 
with shaking at 30°C for a few hours, thereby inducing expres- 
sion of the library cDNA- activation domain fusion from the GAL2- 
inducible promotor. Finally these "induced" cells were plated on 
Galactose/Raff inose minimal selective medium plates lacking the 
amino acid 1-leucine. On these plates cell growth was sustained 
only upon activation of a LEU2- selector gene through molecular 
interaction of the respective LexA- fusion and activation domain- 
fusion proteins. The LJ5TU2-gene codes for an essential metabolic 
enzyme needed for the biosynthesis of leucine from other amino 
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acid precursors . All clones growing under these restrictive con- 
ditions were isolated and analyzed for the activity of the lacZ- 
reporter gene, encoding the metabolic enzyme fi-Galactosidase 
from the enterobacterium E.coli, by a standard X-Gal assay (e.g. 
Bartel and Fields (eds.), Oxford University Press 1997). From 
all candidate clones that passed these two selection steps, the 
cDNA- library plasmids were isolated again by standard techniques 
(e.g. Methods in Yeast Genetics, Cold Spring Harbour Laboratory 
Press, 1997) and retested for specific interaction with Lgs in 
the X-Gal assay, using an unrelated LexA- fusion protein as a 
negative control. By this procedure three independent cDNA- 
clones were identified, that strongly and specifically inter- 
acted only with Lgs and contained partially overlapping se- 
quences: BK12b, BK14b and TK5.35h. By searching the Drosophila 
genome database using the blastn algorithm 

( http: //www.ncbi .nlm.nih.gov: 80 /BLAST/ ) we mapped the three iso- 
lated cDNAs to the CGI 151 8 locus, coding for a protein product 
of 815 amino acids in length. The cDNA-clones coded different 
parts of the Doll protein, with BK12b containing nucleotides 

(nt) 2223-2448, BK14b nt 2191-2448 and TK4.35h"nt 749-2448 of 
the computationally predicted open reading frame (ORF) . Further 
bioinf ormatical analysis ( http : / /www . ebi . ac .uk/interpro/ ) re- 
vealed that the very C- terminal part of the protein sequence 

(ca. aa 745-805) , present in all three of the Lgs-binding 
clones, was predicted to adapt a PHD-finger fold, which has been 
identified in other proteins involved in transcriptional regula- 
tion at different levels. 

Example II: Identification of Human and House Homologues of 
Drosophila Doll 

After the identification of the Drosophila Doll amino acid se- 
quence, publicly available databases were searched for similar 
protein sequences in other species, using the tblastn algorithm 
( ht tp : / / www . ncbi . nlm . nih . gov : 8 0 /BLAST/ ) . Two candidate sequences 
were found each in ESTs from Mus musculus and Homo sapiens, re- 
spectively, the putative protein products of which display high 
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similarity to Doll in their Oterminal domains. These stretches 
of high similarity are predicted to adapt a PHD- finger fold as 
well (Figure 4) . Doll proteins do not display other known struc- 
tural motifs in their N-terminal sequences but they display a 
second high homology domain, which was accordingly named DOLL 
Homology Domain (DHD) (Figure 4) . Doll proteins of both inverte- 
brate and vertebrate origin have so far not been further de- 
scribed or experimentally studied, and have thus not previously 
been implicated in any specific biological process. 

Example III: Isolation and mapping of Drosophila doll alleles 

EMS-treated males were crossed to females carrying a wg trans- 
gene (sev-wg) driven by two copies of the sevenless enhancer 
(Basler, Christen et al . 1991). 2 x 10 s progeny were screened 
for suppressors of the rough eye phenotype. Third chromosomal 
suppressors were coarsely mapped by meiotic recombination using 
a panel of Pty + ] insertions. One such suppressor, Sup 130 , 
showed intriguingly dominant lethality in combination with the 
lgs allele lgs 17E (US09/915 . 543) (Sup 130 /+ lgs 17E /+ transhetero- 
zygous animals do not survive) , strongly suggesting a close ge- 
netic interaction. Fine mapping of the mutation using denaturing 
HPLC (WAWE system, Transgenomic Inc.) demonstrated that it lo- 
calizes within the doll gene. The doll coding region was there- 
fore sequenced using PCR fragments covering the doll coding re- 
gion derived from genomic DNA from homozygous Sup 130 mutant lar- 
vae. The defect in Sup 130 was found to be a 14 bp deletion (nu- 
cleotides 2253 to 2266: 5 1 CATGTGCCACAAGG 3') within the doll 
open reading frame that induced a frame- shift subsequent to 
amino acid 751 and resulted in the formation of a premature stop 
codon. Hence this allele is referred to as doll 130 and encodes a 
truncated Doll protein lacking the C- terminal PHD finger. 

Pole cell transplantation, chromosome squashes, and chromosome 
in situ hybridization experiments were carried out according to 
standard protocols (Ashburner 1989) . 



Example IV: Use of doll as a hybridization probe 
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The- following method describes the use of a non-repetitive nu- 
cleotide sequence of doll as a hybridization probe. The method 
can be applied to screen for doll homologues in other organisms 
as well. DNA comprising the sequence of doll (as shown in Fig- 
ures 1,2,3) is employed as probe to screen for homologue DNAs 

(such as those included in cDNA or genomic libraries) . 
Hybridization and washing of the filters containing either li- 
brary DNAs is performed under standard high stringency condi- 
tions (Sambrook, Fritsch et al . 1989). Positive clones can be 
used to further screen larger cDNA library platings. Representa- 
tive cDNA-clones are subsequently cloned into pBluescript 

(Stratagene) or similar cloning vectors and sequenced. 

Example V: Use of doll as a hybridization probe for in situ hy- 
bridization. 

In situ hybridization of Drosophila doll mRNA can be performed 
in embryo as described in (Tautz and Pfeifle 1989) . However, 
with small modifications it can also be used to detect any mRNA 
transcript in Drosophila larval imaginal discs or vertebrate 
tissue sections. Labeled RNA probes can be prepared from line- 
arized doll cDNA (as showed in Figures 1,2,3), or a fragment 
thereof, using the DIG RNA labeling Kit (SP6/T7) (Boehringer 
Mannheim) following the manufacturer's recommendations. 

Example VI: Expression of doll in Drosophila jnelanogaster 

Doll can be expressed in Drosophila in the whole organism, in a 
specific organ or in a specific cell type, during the whole life 
or only at a specific developmental stage, and at different lev- 
els. An overview of the standard methods used in Drosophila ge- 
netics can be found in (Brand and Perrimon 1993; Perrimon 1998; 
Perrimon 1998) . 

Generation of doll mutant embryos 

Mosaic germlines are generated with the help of site- specific 
recombination through the FLP recombinase (Xu and Rubin 1993) . 
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Females of the genotype hsp70:flp, FRT82 doll 130 / FRT82 ubi-GFP 
are heat -shocked at 37 °C for 1 hr during the third instar larval 
stage to induce FLP-directed recombination and later mated to 
doll 130 / TM6b[y+] males. Germline mosaics are induced. The 
source of recombinase is a first chromosome insertion of a fu- 
sion of the hsp70 promoter (denoted by "hsp70") to the FLP cod- 
ing sequence. Somatic recombination at the FRT82 sites gives 
rise to adult female germ line that produces oocytes that upon 
fertilization lead to embryos which do not contain neither zy- 
gotic nor maternally contributed information for the production 
of functional dDoll protein. Those embryos can be identified by 
the absence of the yellow+ phenotype provided by the TM6b[y+] 
paternal balancer chromosome. For analysis, cuticles are pre- 
pared by standard techniques from mutant embryos, and examined 
by dark field microscopy. 

dAPC2 doll doublemutant germ line clones were generated with an 
FRT82 dAPC2 DS doll 130 chromosome. The FRT82 ovo m chromosome 
(Chou and Perrimon 1996) was used to select for mutant germ 
cells. The FRT82 doll 130 chromosome was also used to create doll 
mutant clones in discs, in conjunction with an FRT82 arm-lacZ 
chromosome . 

Generation of doll mutant embryos expressing constitutively ac- 
tive Arm 

In order to express constitutively active Arm ("AArm"), females 
of the genotype described above are heat shocked at 37°C for 1 
hr during late pupal stages and mated to males of the genotype 
OAS : A Arm hsp70-Gal4 / UAS:AArm hsp70-Gal4 ; doll / TM6b[y+] . Due 
to the presence of the additional transgenes in these males off- 
spring that had arisen from a doll mutant oocytes and doll mu- 
tant sperm express upon heat treatment the constitutively active 
Arm protein, that transiently induced Wingless target genes . 

Example VII: Rescue of ddoll-/- flies with hdoll-1 and hdoll-2 
cDNA expression 

In order to confirm the functional homology between Droso- 
phila and human Doll-1 and human Doll- 2, the human genes were 
introduced into Drosophila flies carrying two mutant doll al- 
leles (ddoll-/- flies). Specifically, flies carrying e.g. a 
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tub:hdoll transgene, and two mutant doll alleles, e.g. doll 130 
and EP(3) 1076 {publicly available) were generated, ddoll-/- mu- 
tant flies display larval or pupal lethality. In contrast, 
ddoll-/- mutant flies carrying at least one copy of the 
tub:hdoll-l or tub:hdoll-2 transgenes survive to adulthood. This 
demonstrates that both, hDoll-1 and hDoll-2, can replace endoge- 
nous dDoll function in flies and thus validates functional ho- 
mology between Drosophila and human Doll (Figure 7) . 

Example VIZI: Protein production and purification of Doll in E. 
coli 

The following method describes recombinant expression of Doll 
proteins in bacterial cells. DNA encoding full-length or a trun- 
cated Doll form is fused e.g. downstream of an epitope tag or 
glutathione-S-transf erase (GST) cDNA and a thrombin or entero- 
kinase cleavage site contained within an inducible bacterial ex- 
pression vector- Such epitope tags include poly-his, S-protein, 
thioredoxin and immunoglobin tags . A variety of plasmids can be 
employed, including commercially available plasmid such as pGEX- 
4T (Pharmacia) or pET-32a (Novagen) . 

Briefly, a bacterial expression plasmid containing the doll se- 
quence, for instance fused to a GST- sequence, is transformed by 
conventional methods into protease deficient E.coli such as BL21 
(Stratagene) . A bacterial colony containing the plasmid is then 
expanded overnight in selection medium to reach saturation. The 
next morning, this culture is diluted 1:100 and bacterial are 
allowed to growth to an optical density (OD 6 oo) of 0.6. Protein 
production is initiated by addition of an inducer of the pro- 
moter under which GST-Doll fusion protein is expressed. A vari- 
ety of inducers can be employed depending on the expression vec- 
tor used, including IPTG. 

Expressed GST tagged Doll can then be purified, for instance, 
using affinity beads or affinity chromatography, such as gluta- 
thione beads (commercially available from Pharmacia) . Extracts 
are prepared by lysing the Doll-expressing bacteria in sonica- 
tion buffer (10 mM Tris HC1 pH 8.0, 150 mM NaCl, 1 mM EDTA, 1.5% 
sarkosyl, 2% Triton-X-100, 1 mM DTT and protease inhibitors) , 
followed by short sonication on ice (e.g. 3 times 20 seconds at 
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middle power) and centrifugation. Cleared supernatant s are then 
incubated under gentle rotation for example with glutathione 
beads for 1 hrs at 4°C. Next beads are washed several time in 
washing buffer (20 mM Tris pH 8.0, 200 raM NaCl, 1 mM EDTA, 1 mM 
DTT, 1 mM MgCl 2 , 0.5% NP40) , and finally stored in storage 
buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM EDTA, 1 mM DTT, 1 
mM MgC12, 10 % glycerol, 0.5% NP40, and proteinase inhibitors). 

Alternatively, a His-tagged, S-protein, thioredoxin or IgG 
tagged Doll can be purified using affinity chromatography. 

The quality of the preparations can be checked e.g. by SDS-gel 
electrophoresis and silver staining or Western blot. 

In case the epitope -tag has to be cleaved, several methods are 
available depending on the presence of a cleavage site between 
the epitope- tag and the Doll protein. For example, it is possi- 
ble to produce a GST-Doll fusion protein containing a thrombin 
cleavage site right before the first Doll amino acid. Briefly, a 
GST-Doll preparation on glutathione -af f inity beads is washed 
several times in cleavage buffer (50 mM Tris HC1 pH 7.0, 150 mM 
NaCl, 1 mM EDTA, 1 mM DTT) . Thrombin is then added and the sam- 
ples are incubated for over 16 h at room temperature. Super- 
natants are then collected and analyzed for successful cleavage 
of Doll from the beads by polyacrylamide gel electrophoresis and 
silver staining or Western blot. 



Example IX: Protein-protein interactions involving Doll 

A GST- fusion protein in vitro binding assay can be performed to 
map binding domains and find additional interaction partners . 
For this purpose, proteins are in vitro translated using reticu- 
locyte lysates (e.g. TNT-lysates, Promega Corporation) contain- 
ing [ 35 S] methionine following the instructions provided by the 
manufacturer. Alternatively, cellular proteins can be labeled by 
incubation of culture cells with [ 35 S] methionine . Glutathione S- 
transf erase (GST) fusion proteins, produced as illustrated in 
the Example VIII, are immobilized on glutathione -Sepharose and 
blocked in binding buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM 
EDTA, 1 mM DTT, 1 mM MgC12, 10 % glycerol, 0.5% NP40, 0.05% BSA, 
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and proteinase inhibitors) for 45 min. Two /xg of immobilized GST 
proteins are then incubated for 1.5 hrs with 0.5-6 \il of in vi- 
tro translated proteins in binding buffer or with 
[ 35 S] methionine labeled cell extract. The beads are washed four 
times in washing buffer (20 mM Tris pH 8.0, 200 mM NaCl, 1 mM 
EDTA, 1 mM DTT, 1 mM MgCl 2 , 0.5% NP40) and boiled in Laemmli SDS 
sample buffer. Proteins binding to Doll are detected by autora- 
diography. In case that a cell lysate were used to identify 
novel Doll binding partner, the protein bands on the gel can be 
isolated by methods known in the art, and the protein sequence 
can be determined e.g. by mass spectrophotometrical analysis. 

A yeast two hybrid assay can additionally be performed to con- 
firm the results of the in vitro binding assays described above 
or to screen cDNA library for new interaction partners (Fields 
and Sternglanz 1994) . In this context, the desired cDNAs are 
subcloned into appropriate yeast expression vectors that link 
them either to a Lex DNA binding domain (e.g. pLexA, Clontech) 
or an acidic activation domain (e.g. pGJ4-5, Clontech). The ap- 
propriate pair of plasmids is then transformed together with a 
reporter plasmid (e.g. pSH18-34, Clontech) into an appropriate 
yeast* strain (e.g. EGY4 8) by the lithium acetate-polyethylene 
glycol method and grown on selective media (Sambrook, Fritsch et 
al. 1989). Transformants are analyzed for reporter gene activity 
as described by the manufacturer of the vector-reporter plasmid 
used. To establish reproducibility the interactions is tested in 
both directions. Alternatively, this method is used to screen 
for novel Doll interaction partners. In this context, e.g. 
pLexA-Doll is transfected into yeast together with a cDNA li- 
brary cloned into e.g. pGJ4-5 as described above. Positive 
clones can be isolated and the cDNA they contain can be se- 
quenced by methods known by people skilled in the art. 

Example X: Immnnohistochemistry 

Localization of the Doll proteins is performed on Drosophila. em- 
bryo, imaginal discs, invertebrate and vertebrate adult tissue 
sections or tumor cell lines using- the anti-Doll antibodies pro- 
vided by this invention. For instance, if a tumor cell line is 
used, cells can be seeded into poly lysine- coated 8 well chambers 
(Nalge-Nunc Internat . ) and grown overnight at 37°C. As a posi- 
tive control, 293 MEK cells (ATCC) cells might be transfected 
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e.g. by a lipof ection method (e.g. Lipof ectamine, Gibco tech- 
nologies) with a Doll expression plasmid, such as pcDNA3.1 (In- 
vitrogen) . Two days after transfection, cells are washed and 
fixed with 3.7% formaldehyde in PBS for 10 min, permeabilized in 
0.5% Triton-X-100 for another 10 min, and blocked with a 1:1000 
dilution of pre-immunoserum in 2% BSA-PBS for Ih at RT. Cells 
are then incubated with a 1:1000 dilution of anti-Doll immunose- 
rum for 2 hrs at RT, followed by washing in PBS and staining 
with anti-rabbit secondary-antibody. The washing step is re- 
peated and preparations are blocked in a solution of 3% BSA in 
PBS/0.1% TritonX-100 for 1 hr. The slides are then washed three 
times for 5 min in PBS and incubated with a 1:200 dilution (v/v) 
of TRITC- conjugated swine anti-rabbit immunoglobulin (Dako, 
Inc . ) . The washing step is repeated before applying coverslips 
using Vectashield® mounting medium (Vector Laboratories, Inc.). 
Detection of other proteins such as p-Catenin, hLgs or Tcf can 
be performed in the same way using anti-(3-Catenin (commercially 
available), anti-hLgs (US 09/915.543) or anti-Tcf (commercially 
available) specific antibodies, respectively. 



Example XI: Lucif erase reporter gene assays 

The effect of Doll on Tcf transactivation activity can be per- 
formed in a cell culture system using a Tcf responsive lucifer- 
ase reporter gene. Depending on the expression vector used, this 
protocol can be applied for mammalian as well as for Drosophila 
cell lines. For instance, HEK293 cells (ATCC) are a well suit- 
able system. Hereby, Doll full length cDNA is cloned into a mam- 
malian expression vector, such as pcDNA3 (Invitrogen) , and 
transfected together with the TOPFLASH luciferase reporter plas- 
mid (Upstate biotechnology, New York, USA) into 293 cells. A 
lipofection agent like the Lipof ectamine transfection reagent 
(Life Technologies, Inc.) can be used for this purpose. A 
renilla luciferase reporter plasmid, e.g. pRL-SV4 0, (Promega 
Corporation, Madison USA) , is co- transfected to normalize the 
transfection efficiency. Cell extracts are prepared 48 h after 
transfection and assayed for firefly and renilla luciferase ac- 
tivity as described by the manufacturer (Dual luciferase re- 
porter assay system, Promega Corporation) . All the luciferase 
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values are normalized for renilla lucif erase activity (see Fig- 
ure 8) . 

Example XII: Screening of chemical compounds, organic products 
or peptides interfering with Doll function 

A reporter gene assay is performed with a similar protocol as 
described in example XI, but scaled down to be performed as a 
high throughput screening. For this purpose colon cancer cell 
lines with mutated and/or constitutively active p-Catenin are 
stably transfected with the Topflash vector described in Example 
XI and Doll cDNA. The established monoclonal population, which 
gives the most reliable and constant reporter gene activity is 
selected for later assays. One day after plating, cells are 
treated with single compounds derived from a chemical or peptide 
library. One to 24 hours later reporter gene activity is meas- 
ured. Compounds found to inhibit reporter gene activity are then 
further characterized for specific activity on the Doll- 
containing transcriptional complex. Alternatively, Wnt pathway \ 
activity can be measured by detecting mRNA or protein levels of 
a target gene, e.g. myc (He, Sparks et al . 1998) . ^ 

Example XIII: Screening assay based on protein-protein interac- 
tion for compounds inhibiting Doll-Lgs or Doll interaction part- 
ner X 

Doll and its interaction partner or fragments thereof are pro- 
duced and purified e.g. from E.coli cultures (e.g. as described 
in example VIII) . Proteins are tagged e.g. with 6 histidines, S- 
protein, GST or thioredoxin. Small aliquot s of the purified pro- 
teins are incubated in an appropriate binding buffer. At this 
point chemical compounds are added to the mixture and their ca- 
pacity to disrupt the protein-protein interaction is monitored 
e.g. by any of the methods described below. Compounds inhibiting 
this interaction are subsequently tested for their specificity 
and in vivo toxicity. Well established methods to monitor pro- 
tein-protein interactions are e.g.: 

• Time resolved fluorometry with lanthanide chelate labels 
(Hemmila I. And Webb S. DDT 2: 373-381 (1997)) 

• Scintillation proximity assay (SPA) (Amersham life Science) 
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Fluorescence polarisation 
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Claims 

1. A nucleic acid sequence coding for a polypeptide being 
part of at least one signaling pathway in insects and 
vertebrates, characterized in that said nucleotide se- 
quence is the "Daughter of Legless" (DOLL) gene as well 
as homologues, fragments, derivatives and functional and 
structural analogs thereof . 

2. The nucleic acid sequence according to claim 1, charac- 
terized in that said signaling pathway is the Wnt sig- 
naling pathway. 

3. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the Drosophila melanogaster doll 
gene {ddoll) comprising the nucleotide sequence as shown 
in SEQ. ID, NO 1. 

4. The nucleic acid sequence according to claim 3, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the Drosophila Doll 
polypeptide as shown in SEQ. ID. NO 6. 

5. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the human doll-1. {hdoll-1) com- 
prising the nucleotide sequence as shown in SEQ ID. NO 
2 . 

6. The nucleic acid sequence according to claim 5, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the human Doll 
(hDoll-1) polypeptide as shown in SEQ. ID. NO 7. 
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7. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the human doll-2 gene (hdoll-2) 
having the nucleotide sequence as shown in SEQ ID. NO 3. 

8. The nucleic acid sequence according to claim 7, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100% , preferably 100% sequence identity to a 
fragment or the entire sequence of the human Doll poly- 
peptide (hDoll-2) as shown in SEQ. ID. NO 8. 

9. The nucleic acid sequence according to claim 2, charac- 
terized in that it is the mouse doll-1 gene (mdoll-1) 
comprising the nucleotide sequence as shown in SEQ ID. 
NO 4 . 

10. The nucleic acid sequence according to claim 9, charac- 
terized in that it is coding for a polypeptide compris- 
ing 50% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the mouse Doll 
(mDoll-1) polypeptide as shown in SEQ. ID. NO 9. 

11. The nucleic acid sequence according to claim 2, charac- 
terized in that it. is the mouse doll-2 gene (mdoll-2) 
comprising the nucleotide sequence as shown in SEQ ID. 
NO 5. 

12. The nucleic acid sequence according to claim 11, charac- 
terized in that it is coding for a polypeptide compris- 
ing 5 0% to 100%, preferably 100% sequence identity to a 
fragment or the entire sequence of the mouse Doll poly- 
peptide (mDoll-2) as shown in SEQ* ID. NO 10. 

13. A polypeptide being part of at least one signaling path- 
way in insects and vertebrates, characterized in that 
said polypeptide is the "Daughter of Legless" (DOLL) 
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protein as well as homologues, fragments, derivatives 
and structural and functional analogs thereof . 

14. The polypeptide according to claim 13, characterized in 
that said signaling pathway is the Wnt signaling path- 
way. 

15. The polypeptide . according to claim 14, characterized in 
that said polypeptide is the Drosophila melanogaster 
Doll protein having the amino acid sequence as shown in 
SEQ. ID. NO. 6. 

16. The polypeptide according to claim 15, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ- ID. NO 6. 

17. The polypeptide according to claim 14, characterized in 
that said polypeptide is the human Doll protein (hDOKL- 
1) having the amino acid sequence as shown in SEQ. ID. 
NO. 7. 

18. The polypeptide according to claim 17, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ. ID. NO 7. 

19. The polypeptide according to claim 14,. characterized in 
that said polypeptide is the human Doll protein (hDoll- 
2) having the amino acid sequence as shown in SEQ. ID. 
NO. 8 . 



20. The polypeptide according to claim 19, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
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sequence identity to a fragment or the entire sequence 
of the Doll polypeptide as shown in SEQ. ID. NO 8. 

21. The polypeptide according to claim 14, characterized in 
that said polypeptide is the mouse Doll-1 protein (mDoll- 

1) having the amino acid sequence as shown in SEQ. ID. 
NO. 9. 

22. The polypeptide according to claim 21 , characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence of 
the Doll polypeptide as shown in SEQ. ID. NO 9. 

23. The polypeptide according to claim 14, characterized in 
that said polypeptide is the mouse Doll -2 protein (mDoll- 

2) having the amino acid sequence as shown in SEQ. ID. 
10. 

24. The polypeptide according to claim 23, characterized in 
that said polypeptide has a 50% to 100%, preferably 100% 
sequence identity to a fragment or the entire sequence of 
the Doll polypeptide as shown in SEQ. ID. NO 10. 

25. The nucleic acid sequence according to claim 2, charac- 
terized in that said nucleic acid sequence is coding for 
a polypeptide comprising a low overall amino acid se- 
quence identity and a sequence identity of 50% to 100%, 
preferably 100% in conserved domains. 

26. The nucleic acid sequence according to claim 25, charac- 
terized in that said conserved domain is the DHD domain. 

27. The nucleic acid sequence according to claim 26, charac- 
terized in that said conserved domain is the PHD finger. 

28. The nucleic acid sequence according to claim 2, charac- 
terized in that said sequence comprises a fragment of 20 
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to 100 nucleotides in length, preferably from 20 to 60 
nucleotides, and most preferably from 20 to 50 nucleo- 
tides, said fragment being derived from SEQ. ID. NO. 1, 
2, 3, 4 or 5. 

29. The nucleic acid sequence according to claim 28, used as 
a hybridization probe. 

30. A vector comprising a nucleic aeid molecule encoding a 
Drosophila or vertebrate doll gene or a fragment thereof, 
selected from the group consisting of SEQ . ID. NO. 1, 2, 
3 , 4 or 5 . 

31. The vector according to claim 30, selected from the group 
consisting of eucaryotic and procaryotic expression vec- 
tors . 



32. A host cell comprising the vector of claim 31, selected 
from the group consisting of mammalian cells, yeast 
cells, plant cells, insect cells or bacterial cells. 

33. A method for the preparation of Doll proteins, fragments, 
derivatives and analogs thereof, comprising the steps of: 

a) isolating a nucleic acid sequence containing the com- 
plete doll cDNA sequence or part thereof; 

b) recombinant ly expressing doll cDNA or a fragment 
thereof in bacterial, mammalian, plant, yeast or in- 
sect cells; 

c) inducing protein production in said cells; 
d) purifying Doll proteins. 

34. The method according to claim 33, wherein step a) com- 
prises the isolation of a nucleic acid molecule encom- 
passing the doll cDNA from a vertebrate or invertebrate 
cDNA or genomic library. 
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The method according to claim 33, wherein step b) com- 
prises the fusion of DNA encoding full-length or trun- 
cated doll to an epitope tag and a cleavage site con- 
tained within an inducible eucaryotic or procaryotic ex- 
pression vector and transforming the appropriated host 
cells with said expression vector. 

A chimeric protein comprising a Doll polypeptide fused to 
a heterologous amino acid sequence selected from the 
group consisting of an epitope- tagged sequence, an anti- 
body, gluthathione-S- transferase protein, 0- 
galactosidase, and alkaline phosphatase. 

The chimeric protein according to claim 36, characterized 
in that the Doll polypeptide is selected from the group 
consisting of dDoll, hDoll-1, hDoll-2, mDoll-1 and mDoll- 
2 . 

The polypeptide according to claim 14, characterized in 
that it comprises the full length Doll polypeptide or a 
fragment thereof comprising an antibody- binding site for 
an anti-Doll antibody. 

9. An antibody specifically recognizing a Doll polypeptide, 
said antibody being selected from the group consisting of 
polyclonal and monoclonal antibodies and fragments 
thereof . 

An assay for studying diseases induced by a disrupted Wnt 
function or for drug screening comprising the use of or- 
ganisms selected from the group consisting of Drosophila, 
mice, rats, rabbits, chicken, frogs, pigs or sheep, said 
organisms showing increased or reduced or no expression 
of doll or express a mutated Doll polypeptide in at least 
one tissue or organ. 
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41. The assay according to claim 40 , characterized in that 
said organisms express the doll gene as a heterologous 
transgene . 

41. The assay according to claim 40, characterized in that 
said doll gene comprises a mutation selected from the 
group consisting of deletions, point mutations, foreign 
DNA insertions and inversions. 

42. Use of a Doll protein, homologue, derivative and fragment 

thereof for the development of a therapeutic and diagnos- 
tic method for the treatment of conditions selected from 
the group consisting of disorders of cell fate, differen- 
tiation or proliferation. 

43. Use of a Doll protein, homologue, derivative and fragment 

thereof for the development of a therapeutic and diagnos- 
tic compound for the diagnosis of conditions selected 
from the group consisting of disorders of cell fate, dif- 
ferentiation or proliferation. 

44.. The use of a Doll protein, homologue, derivative and frag- 
ment thereof according to claim 43, characterized in that 
said therapeutic or diagnostic compound is selected from 
the group consisting of Drosophila and vertebrate Doll 
protein homologues and fragments thereof and antibodies 
and antibody fragments thereof . 

45. Use of doll nucleic acids, homologues, derivatives and 
fragments thereof for the development of a therapeutic 
and/or diagnostic compound for the treatment of condi- 
tions selected from the group consisting of disorders of 
cell fate, differentiation or proliferation and its ap- 
plication to an individual. 
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46. Use of doll nucleic acids, homologues, derivatives and 

fragments thereof for the development of a therapeutic 
and diagnostic method for the treatment of conditions se- 
lected from the group consisting of disorders of cell 
fate, differentiation or proliferation and its applica- 
tion to an individual. 

47. The use of doll nucleic acids, homologues, derivatives and 

fragments thereof according to claim 45, characterized in 
that said therapeutic or diagnostic compound is selected 
from the group consisting of doll antisense DNA or RNA, 
doll double -stranded RNA and chemical or natural occur- 
ring compounds interfering with doll function. 

48. A peptide comprising a fragment of the Doll polypeptide. 

49. The peptide according to claim 48, comprising 40 to 60 

i 

amino acids in the N- terminal region of hDoll-1, hDoll-2, 
. K mDoll-1, mDoll-2 and Drosophila Doll. 

50. The peptide according to claim 49, said peptide comprising 

the Doll homology domain (DHD) . 

51. Use of DHD in screening methods for the identification of 
chemical compounds, organic products, polypeptides or peptides 
interfering with Doll function in the Wnt pathway. 

52 . The use of DHD according to claim 51 in screening assay 
based on protein-protein interactions. 

53. The use of DHD according to claim 52, wherein said screen- 
ing assay is selected from the group comprising an in vitro 
protein-protein interaction assay and a protein-protein in- 
teraction assay in a host cell . 
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The use of DHD according to claim 51, in a screening assay 
for compounds specifically inhibiting the interaction be- 
tween Doll and another protein. 
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Figure 1 : The translated sequence of the Drosophila 
doll gene. 

dDoll cDNA Sequence Range: 1 to 2448 

ATGACCCACAATCTTGGTATGGCGCCATATCGATTGCCGGGTCCAGCGGG 

CGGACTCTGTCCGCCCGATTTTAAGCCGCCGCCTCCCACGGACATCATCT 

CGGCGCCGAGCAATCCGAAGAAGCGGCGAAAAACCTCAAGTGCCGCCAAC 

TCCGCTGCAGCGGTGGCTGCGGCGGCGGCTGCAGCAGCTGCTGCGAATTC 

CATGCAGCAGCAGCAGGCGCCACCCACACCGCAGGATTTGCTGCCCCCTC 

CGCCAATGGGAGGCTTCGGAGACACCATTATTGCCTCGAATCCATTCGAC 

GACAGTCCCCAGGTGTCGGCGATGTCCAGCTCAGCGGCCGCGGCGATGGC 

GGCCATGAATCAGATGGGCGGCGGACCAGGAGGTGGTCACTTTGGCGGCG 

GTGGACCGGGTGGGCACCCGCACTGGGAAGACCGCATGGGCATGGGCGGT 

GGACCTCCTCCCCCGCCTCACATGCATCCCCATATGCACCCGCATCATCC 

AGGCGGACCTATGGGTCACCCACATGGCCCACATCCGCACATGGGTGGTC 

CACCTCCAATGCGAGGAATGAGCCCCATGCACCCCCATCAAATGGGACCG 

GGACCAGGCGTCGGACTACCGCCGCATATGAATCACGGAAGGCCAGGGGG 

ACCTGGTGGTCCTGGAGGACCCGTCCCAATGGGTAGTCCCATGGGTGGAA 

TAGCTGGCATGGGCGGCATGAGCCCAATGGGCGGAATGGGAGGCCCCAGC 

ATATCACCCCATCACATGGGCATGGGTGGTCTGTCGCCCATGGGAGGCGG 

TCCCAACGGACCCAATCCGCGAGCCATGCAGGGTTCACCGATGGGCGGTC 

CGGGGCAGAACTCGCCAATGAACTCACTGCCTATGGGTTCGCCAATGGGC 

AATCCAATTGGCAGCCCGTTGGGCCCTCCCTCGGGACCGGGCCCTGGGAA 

TCCCGGCAATACCGGCGGACCACAGCAGCAACAACAACAACCTCCGCAGC 

CACCGATGAACAACGGGCAGATGGGTCCTCCTCCTCTGCACAGTCCGCTC 

GGAAACGGACCAACGGGTCATGGCAGTCACATGCCTGGAGGACCAATCCC 

AGGACCAGGTCCTGGGCCTGGCGGCCTAGTAGGTCCCGGTGGCATCTCCC 

CCGCGCACGGCAATAAGCCGGGTGGTTCTGGGAACAACATGCTCGGCGGG 

AATCCCGGCGGCGGCAACAGCAACAACAACGGAAGCAATACAAGTAACGC 

CAGCAACAACAATCAAAATCCTCACCTCTCGCCAGCAGCCGGACGCCTGG 

GAGTGCCGACGTCGATGCAGTCGAATGGACCTTCGGTATCATCGGTAGCC 

TCCTCATCGGTTCCCTCGCCCGCCACGCCCACGCTCACGCCCACATCGAC 

GGCCACGTCCATGTCCACGTCAGTGCCTACATCCTCGCCAGCGCCGCCCG 

CCATGTCACCGCATCACTCGCTAAACAGCGCCGGGCCGAGTCCGGGCATG 

CCCAACTCGGGACCCAGCCCGCTGCAGTCACCAGCCGGACCCAATGGCCC 

CAATAACAACAACAGCAATAACAACAACGGACCCATGATGGGCCAGATGA 

TCCCGAACGCAGTTCCTATGCAGCACCAGCAGCACATGGGCGGCGGCCCA 

CCTGGCCACGGGCCCGGACCAATGCCCGGAATGGGCATGAACCAAATGCT 

GCCACCGCAGCAACCCTCCCATCTTGGTCCCCCGCATCCGAATATGATGA 
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ACCACCCGCATCATCCGCACCATCATCCTGGCGGACCACCGCCGCACATG 
ATGGGTGGACCCGGAATGCACGGCGGTCCTGCTGGAATGCCTCCTCATAT 
GGGCGGAGGACCTAATCCGCACATGATGGGCGGTCCGCACGGGAACGCGG 
GTCCGCACATGGGCCACGGCCACATGGGTGGAGTACCAGGTCCAGGACCC 
GGACCCGGCGGCATGAACGGACCCCCGCATCCGCACATGTCCCCGCACCA 
CGGACATCCGCATCACCACCACAATCCGATGGGCGGCCCAGGTCCAAATA 
TGTTCGGCGGTGGTGGAGGAGGTCCCATGGGTCCCGGTGGACCGATGGGC 
AACATGGGGCCCATGGGAGGTGGCCCGATGGGCGGCCCTATGGGCGTAGG 
TCCCAAGCCGATGACAATGGGCGGCGGGAAGATGTACCCGCCGGGACAGC 
CAATGGTCTTTAATCCGCAGAACCCGAATGCGCCGCCCATATATCCTTGT 
GGCATGTGCCACAAGGAGGTGAACGACAACGACGAAGCCGTGTTCTGTGA 
ATCCGGTTGTAACTTTTTCTTTCACAGAACCTGTGTTGGCCTGACAGAGG 
CGGCCTTCCAAATGCTCAACAAGGAGGTGTTTGCCGAGTGGTGCTGCGAC 
AAGTGCGTGTCTTCCAAGCATATTCCCATGGTCAAGTTCAAGTGTTGA 



dDoll Protein sequence 1-815 

MTHNLGMAPYRLPGPAGGLCPPDFKPPPPTDIISAPSNPKKRRKTSSAAN 
SAAAVAAAAAAAAAANSMQQQQAPPTPQDLLPPPPMGGFGDTIIASNPFD 
DSPQVSAMSSSAAAAMAAMNQMGGGPGGGHFGGGGPGGHPHWEDRMGMGG 
GPPPPPHMHPHMHPHHPGGPMGHPHGPHPHMGGPPPMRGMSPMHPHQMGP 
GPGVGLPPHMNHGRPGGPGGPGGPVPMGSPMGGIAGMGGMSPMGGMGGPS 
ISPHHMGMGGLSPMGGGPNGPNPRAMQGSPMGGPGQNSPMNSLPMGSPMG 
NPIGSPLGPPSGPGPGNPGNTGGPQQQQQQPPQPPMNNGQMGPPPLHSPL 
GNGPTGHGSHMPGGPIPGPGPGPGGLVGPGGISPAHGNNPGGSGNNMLGG 
NPGGGNSNNNGSNTSNASNNNQNPHLSPAAGRLGVPTSMQSNGPSVSSVA 
SSSVPSPATPTLTPTSTATSMSTSVPTSSPAPPAMSPHHSLNSAGPSPGM 
PNSGPSPLQSPAGPNGPNNNNSNNNNGPMMGQMIPNAVPMQHQQHMGGGP 
PGHGPGPMPGMGMNQMLPPQQPSHLGPPHPNMMNHPHHPHHHPGGPPPHM 
MGGPGMHGGPAGMPPHMGGGPNPHMMGGPHGNAGPHMGHGHMGGVPGPGP 
GPGGMNGPPHPHMSPHHGHPHHHHNPMGGPGPNMFGGGGGGPMGPGGPMG 
NMGPMGGGPMGGPMGVGPKPMTMGGGKMYPPGQPMVFNPQNPNAPPIYPC 
GMCHKEVNDNDEAVFCESGCNFFFHRTCVGLTEAAFQMLNKEVFAEWCCD 
KC V S S KH I PMVKFKC 
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Figure 2 

hDoll-1 cDNA sequence 

GGATCCCCACATGCCCGCCGAGAACTCTCCAGCTCCCGCTTACAAAGTTT 

CCTCGCATGGTGGTGATAGTGGACTGGATGGGTTAGGAGGACCAGGTGTA 

CAACTAGGAAGCCCAGATAAGAAAAAGCGCAAGGCAAATACACAGGGACC 

TTCTTTCCCTCCATTGTCTGAGTATGCTCCACCACCGAATCCAAACTCTG 

ACCATCTAGTGGCTGCTAATCCATTTGATGACAACTATAATACTATTTCC 

TATAAACCACTACCTTCGTCAAATCCATATCTTGGCCCTGGTTATCCTGG 

CTTTGGAGGCTATAGTACATTCAGAATGCCACCTCACGTTCCCCCAAGAA 

TGTCTTCCCCATACTGTGGTCCTTACTCACTCAGGAACCAGCCACACCCA 

TTTCCTCAGAATCCTCTGGGCATGGGTTTTAATCGACCTCATGCTTTTAA 

CTTTGGGCCACATGATAATTCAAGTTTCGGTAATCCATCTTATAATAATG 

CACTAAGTCAGAATGTCAACATGCCTAATCAACATTTTAGACAAAATCCT 

GCTGAAAATTTCAGTCAAATTCCTCCACAGAATGCTAGCCAAGTTTCTAA 

CCCCGATTTGGCATCTAATTTTGTTCCTGGAAATAATTCAAATTTTACTT 

CTCCGTTAGAATCTAATCATTCTTTTATTCCTCCCCCAAACACTTTTGGT 

CAAGCAAAAGCACCACCCCCAAAACAAGACTTTACTCAAGGAGCAACCAA 

AAACACT AATCAAAATTCCT CTGCTCATCCACCT CACTTGAAT AT GGAT G 

AC ACAGT GAAT CAGAGT AAT ATT GAATT AAAAAATGTT AAT CGAAAC AAT 

GCAGTAAATCAGGAGAACAGCCGTTCAAGTAGCACTGAAGCCACAAACAA 

TAACCCTGCAAATGGGACGCAGAATAAGCCACGACAACCAAGAGGTGCAG 

CAGATGCCTGCACCACAGAAAAAAGCAATAAATCCTCTCTTCACCCAAAC 

CGTCATGGCCATTCGTCTTCTGACCCAGTGTATCCTTGTGGAATTTGTAC 

AAACGAGGTGAACGATGATCAGGATGCCATCTTATGTGAGGCCTCTTGTC 

AGAAATGGTTTCATCGGATCTGTACTGGAATGACTGAAACAGCTTATGGC 

CTCTTAACTGCAGAAGCATCTGCAGTATGGGGCTGTGATACCTGTATGGC 

TGACAAAGATGTCCAGTTAATGCGTACTAGAGAAACTTTTGGTCCATCTG 

CAGTGGGCAGTGATGCTTAATCAAAGGCATTAACTAAAGTGGGTTTATTT 

TCCTGTGCATTGCAGAAGTTCATTGACACAGGATTTTAATGTTTTACATT 

ATTTTTTTAAATGCAT 
hDoll-1 Protein sequence 

MPAENSPAPAYKVSSHGGDSGLDGLGGPGVQLGSPDKKKRKANTQGPSFP 
PLSEYAPPPNPNSDHLVAANPFDDNYNTISYKPLPSSNPYLGPGYPGFGG 
YSTFRMPPHVPPRMSSPYCGPYSLRNQPHPFPQNPLGMGFNRPHAFNFGP 
HDNSSFGNPSYNNALSQNVNMPNQHFRQNPAENFSQIPPQNASQVSNPDL 

AS N F V P GNN SNFTSPLESNHSFIPPPNT FGQ AKAP P PKQ DFTQG ATKNTN 
QNSSAHPPHLNMDDTVNQSNIELKNVNRNNAVNQENSRSSSTEATNNNPA 
NGTQNKPRQPRGAADACTTEKSNKSSLHPNRHGHSSSDPVYPCGICTNEV 
NDDQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTQMADKD 

VQLMRTRETFGPSAVGSDA 
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hDoll-2 cDNA sequence 

CCCGGGTCCCCCACTCCATGGCCGCCTCGGCGCCGCCCCCACCGGACAAG 

CTGGAGGGAGGTGGCGGCCCCGCACCGCCCCCTGCGCCGCCCAGCACCGG 

GAGGAAGCAGGGCAAGGCCGGTCTGCAAATGAAGAGTCCAGAAAAGAAGC 

GAAGGAAGTCAAATACTCAGGGCCCTGCATACTCACATCTGACGGAGTTT 

GCACCACCCCCAACTCCCATGGTGGATCACCTGGTTGCATCCAACCCTTT 

TGAAGATGACTTCGGAGCCCCCAAAGTGGGGGTTGCAGCCCCTCCATTCC 

TTGGCAGTCCTGTGCCCTTCGGAGGCTTCCGTGTGCAGGGGGGCATGGCG 

GGCCAGGTACCCCCAGGCTACAGCACTGGAGGTGGAGGGGGCCCCCAGCC 

ACTCCGTCGACAGCCACCCCCCTTCCCTCCCAATCCTATGGGCCCTGCTT 

TCAACATGCCCCCCCAGGGTCCTGGCTACCCACCCCCAGGCAACATGAAC 

TTTCCCAGCCAACCCTTCAACCAGCCTCTGGGTCAAAACTTTAGTCCTCC 

CAGTGGGCAGATGATGCCGGGCCCAGTGGGGGGATTTGGTCCCATGATCT 

CACCCACCATGGGACAGCCTCCCAGAGCAGAGCTGGGCCCACCTTCTCTG 

TCCCAACGATTTGCTCAGCCAGGGGCTCCTTTTGGCCCTTCTCCTCTCCA 

GAGACCTGGTCAGGGGCTCCCCAGCCTGCCGCCTAACACAAGTCCCTTTC 

CTGGTCCGGACCCTGGCTTTCCTGGCCCTGGTGGTGAGGATGGGGGGAAG 

CCCTTGAATCCACCTGCTTCTACTGCTTTTCCCCAGGAGCCCCACTCAGG 

CTCCCCGGCTGCTGCTGTTAATGGGAACCAGCCCAGTTTCCCCCCGAACA 

GCAGTGGGCGGGGTGGGGGCACTCCAGATGCCAACAGCTTGGCACCCCCT 

GGCAAGGCAGGTGGGGGCTCCGGGCCCCAGCCTCCCCCAGGCTTGGTGTA 

CCCATGTGGTGCCTGTCGGAGTGAGGTGAACGATGACCAGGATGCCATTC 

TGTGTGAGGCCTCCTGCCAGAAATGGTTCCACCGTGAGTGCACAGGCATG 

ACTGAGAGCGCCTATGGGCTGCTGACCACTGAAGCTTCTGCCGTCTGGGC 

CTGCGATCTCTGCCTCAAGACCAAGGAGATCCAGTCTGTCTACATCCGTG 

AGGGCATGGGGCAGCTGGTGGCTGCTAACGATGGGTGACGCTGGTGAAGT 

GGCCCAGGGAAGTGCACATGTCTCTCCCTGCTCTTCCAGGGTGATTTTTT 

TGATGTTTGGCTCTTGGTCCTTGTTTCCACTGGCTTTCCATCCCCATGGG 

GCAGAAACAGTGGCTCCTGGGAGCAGAAAAGGAATTGAGGTGGGCAGGCA 

GAAGAGCCTGGATTGCTCACTGTTTTGGGAAACTTACATGTTGAGATCT 



hDoll-2 protein sequence 

MAASAPPPPDKLEGGGGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNT 
QGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGSPVP 
FGGFRVQGGMAGQVPPGYSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQ 
GPGYPPPGNMNFPSQPFNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMGQ 
PPRAELGPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPG 
FPGPGGEDGGKPLNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGG 
GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASC 
QKWFHRECTGMTE S AYGLLTTE ASAVW ACDLCLKTKE I QS VY I REGMGQL 
VAANDG 
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Figure 3 

mDoll-1 cDNA sequence 

ATGTCAGCGGAACAGGACAAGGAGCCCATCGCGCTGAAGAGAGTTAGAGG 

TGGTGACAGTGGACTGGATGGGTTAGGAGGGCCCAATATACAACTCGGAA 

GCCCAGATAAGAAAAAACGCAAGGCCAACACACAGGGATCTTCCTTTCCT 

CCGCTGTCAGAGTACGCGCCACCACCGAATCCAAACTCCGACCATCTAGT 

GGCTGCCAATCCGTTCGATGACAGCTACAACACCATTTCCTATAAACCAC 

TGCCTTCATCTAATCCATATCTTGGCCCTGGGTATCCTGGCTTTGGAGGC 

TACAGCACATTCAGAATGCCACCCCACGTCCCTCCAAGAATGTCTTCTCC 

CTACTGTGGTCCTTACTCACTCAGGAATCAGCCCCACCCATTTCCTCAGA 

ATCCGTTGGGCATGGGCTTTAACCGGCCTCATGCTTTTAACTTTGGGCCG 

CATGATAATTCGAATTTTGGAAACCCACCTTATAATAATGTACTGACTCA 

GGACATTAACATGCCCGGTCAGCATTTTAGACAAGGCTCTGCTGAAAACT 

TCAGTCAGATTCCCCCGCAGAATGTTGGCCAAGTGTCTAACCCTGACCTC 

GCATCTAATTTTGCCCCTGGGAATAATTCAAATTTTACCTCTCCGTTAGA 

AACGAATCATTCGTTTATTCCACCCCCAAACGCGTTTGGCCAAGCAAAAG 

CTCCACTTCCCAAACAAGACTTCACTCAAGGGGCAACCAAAACCCCGAAT 

CAGAATTCGTCCACTCACCCACCTCACCTAAATATGGAGGATCCAGTCAA 

TCAGAGTAACGTCGAGTTAAAAAATGTCAACAGAAACAACGTTGTCCAAG 

AGAACAGCCGTTCGGGCAGCGCAGAGGCCACCAACAACCATGCGAATGGG 

ACCCAGAACAAGCCCCGGCAGCCCAGGGGCGCAGCTGACCTGTGCACCCC 

CGACAAAAGCCGCAAGTTCTCCCTGCTCCCCAGCCGGCATGGCCATTCCT 

CCTCTGACCCTGTGTACCCGTGCGGGATTTGTACAAATGAAGTGAATGAC 

GATCAGGACGCCATTCTGTGTGAAGCCTCTTGTCAGAAGTGGTTTCATCG 

CATCTGCACTGGAATGACCGAAACAGCCTACGGGCTCCTGACAGCGGAAG 

CATCCGCAGTGTGGGGCTGTGACACGTGCATGGCTGACAAGGATGTCCAG 

CTCATGCGCACTAGAGAGGCCTTTGGTCCACCTGCCGTGGGCGGCGATGC 

CTAA 



mDoll-1 protein sequence 

MSAEQDKEPIALKRVRGGDSGLDGLGGPNIQLGSPDKKKRKANTQGSSFP 
PLSEYAPPPNPNSDHLVAANPFDDSYNTISYKPLPSSNPYLGPGYPGFGG 
YSTFRMPPHVPPRMSSPYCGPYSLRNQPHPFPQNPLGMGFNRPHAFNFGP 
HDNSNFGNPPYNNVLTQDINMPGQHFRQGSAENFSQIPPQNVGQVSNPDL 
ASNFAPGNNSNFTSPLETNHSFIPPPNAFGQAKAPLPKQDFTQGATKTPN 
QNSSTHPPHLNMEDPVNQSNVELKNVNRNNVVQENSRSGSAEATNNHANG 
TQNKPRQPRGAADLCTPDKSRKFSLLPSRHGHSSSDPVYPCGICTNEVND 
DQDAILCEASCQKWFHRICTGMTETAYGLLTAEASAVWGCDTCMADKDVQ 

LMRTREAFGPPAVGGDA 
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mDoll-2 cDNA sequence 

ATGGCCGCCTCGGCGCCGCCCCCACCGGACAAGCTGGAGGGAGGCAGCGG 

CCCCGCACCGCCCCCCGCGCCGCCCAGCACCGGGAGGAAGCAGGGCAAGG 

CCGGTCTGCAAATGAAGAGCCCTGAAAAGAAGCGAAGAAAGTCCAATACT 

CAGGGTCCTGCATATTCACATCTGACGGAGTTCGCCCCACCCCCGACCCC 

CATGGTGGATCATCTGGTCGCTTCTAACCCTTTTGAGGATGATTTCGGAG 

CCCCTAAAGTGGGGGGCGCAGGCCCTCCGTTCCTCGGCAGTCCGGTGCCC 

TTTGGAGGCTTTCGTGTACAGGGGGGCATGGCAGGCCAGGTACCCCCAAG 

CTACGGCACTGGAGGAGGAGGGGGTCCCCAGCCACTTCGTCGGCAGCCCC 

CTCCTTTTCCCCCCAGCCCTATGGGTCCAGCTTTTAATATGCCCCCTCAG 

GGTCCCTGGGGTACCCCGCCCCCTGGCAACATGAACTTTCCCAGTCAACC 

CTTCAACCAGTCTCTGGGCCAAAACTTTAGCCCACCTGGTGGGCAGGTGA 

TGCCAGGCCCAGTAGGCGGATTTGGTCCCATGATCTCACCGACCATGGGA 

CAGCCTCCTAGAGGGGAGCTGGGTCCTCCTCCTCTCCCCCAACGCTTTAC 

CCAACCAGGAGCACCTTATGGTCCTTCTCTTCAGAGACCTGGTCAGGGAC 

TCACCCAGCTGCCCTCCAACACAAGTCCCTTCCCTGGTCCAGACCCTGGT 

TTTCCTGGACCTGGCGGTGAGGATGGTGGGAAGCCCTTGAACCCACCGGC 

TCCCACCGCCTTTCCCCAGGAAGCACCATTCGGGCTCCCCGCTGCTGCTG 

TCAATGGGAATCAGCCCAGTTTCCCCCCTAGCAGCAGTGGTCGAGGTGGG 

GGCACTCCAGATGCCAACAGTCTGGCACCCCCCGGCAAGGCAGGGGGAGG 

CTCAGGGCCCCAGCCTCCCCCAGGCCTGGTGTACCCCTGCGGTGCCTGCC 

GTAGTGAGGTAAATGATGACCAGGATGCCATTCTGTGTGAGGCCTCCTGC 

CAGAAGTGGTTTCACCGCGAGTGCACCGGCATGACCGAGAGTGCCTACGG 

CCTGCTGACCACCGAGGCCTCTGCCGTCTGGGCCTGTGATCTTTGCCTCA 

AGACCAAGGAGATCCAGTCTGTCTACATCCGAGAGGGCATGGGCCAGTTG 

GTGGCTGCTAACGATGGGTGA 



mDoll-2 protein sequence 

MAASAPPPPDKLEGGSGPAPPPAPPSTGRKQGKAGLQMKSPEKKRRKSNT 
QGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAGPPFLGSPVP 
FGGFRVQGGMAGQVPPSYGTGGGGGPQPLRRQPPPFPPSPMGPAFNMPPQ 
GPWGTPPPGNMNFPSQPFNQSLGQNFSPPGGQVMPGPVGGFGPMISPTMG 
QPPRGELGPPPLPQRFTQPGAPYGPSLQRPGQGLTQLPSNTSPFPGPDPG 
FPGPGGEDGGKPLNPPAPTAFPQEAPFGLPAAAVNGNQPSFPPSSSGRGG 
GTPDANSLAPPGKAGGGSGPQPPPGLVYPCGACRSEVNDDQDAILCEASC 
QKWFHRECTGMTESAYGLLTTEASAVWACDLCLKTKEIQSVYIREGMGQL 
VAANDG 
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SEQUENCE LISTING PART 
<110> 'The Genetics Company 

<120> A new esential downstream component of the wingless signalling pathway 
<130> P133559 
<160> 10 

<170> Patentln version 3.1 

<210> 1 

<211> 2448 

<212> DNA 

<213> Drosophila melanogaster 

<400> 1 



atgacccaca 


atcttggtat 


ggcgccatat 


cgattgccgg gtccagcggg 


cggactctgt 


60 


ccgcccgatt 


ttaagccgcc 


gcctcccacg gacatcatct 


cggcgccgag 


caatccgaag 


120 


aagcggcgaa 


aaacctcaag 


tgccgccaac 


tccgctgcag 


cggtggctgc 


ggcggcggct 


180 


gcagcagctg 


ctgcgaattc 


catgcagcag 


cagcaggcgc 


cacccacacc 


gcaggatttg 


240 


ctgccccctc 


cgccaatggg 


aggcttcgga 


gacaccatta 


ttgcctcgaa 


tccattcgac 


300 


gacagtcccc 


aggtgtcggc 


gatgtccagc 


tcagcggccg 


cggcgatggc 


ggccatgaat 


360 


cagatgggcg 


gcggaccagg 


aggtggtcac 


tttggcggcg 


gtggaccggg 


tgggcacccg 


, 420 


cactgggaag 


accgcatggg 


catgggcggt 


ggacctcctc 


ccccgcctca 


catgcatccc 


480 


catatgcacc 


cgcatcatcc 


aggcggacct 


atgggtcacc 


cacatggccc 


acatccgcac 


540 


atgggtggtc 


cacctccaat 


gcgaggaatg 


agccccatgc 


acccccatca 


aatgggaccg 


600 


ggaccaggcg 


tcggactacc 


gccgcatatg 


aatcacggaa 


ggccaggggg 


acctggtggt 


660 


cctggaggac 


ccgtcccaat 


gggtagtccc 


atgggtggaa 


tagctggcat 


gggcggcatg 


_ 720 


agcccaatgg 


gcggaatggg 


aggccccagc 


atatcacccc 


atcacatggg 


catgggtggt 


780 


ctgtcgccca 


tgggaggcgg 


tcccaacgga 


cccaatccgc 


gagccatgca 


gggttcaccg 


840 


atgggcggtc 


cggggcagaa 


ctcgccaatg 


aactcactgc 


ctatgggttc 


gccaatgggc 


900 


aatccaattg 


gcagcccgtt 


gggccctccc 


tcgggaccgg gccctgggaa 


tcccggcaat 


960 


accggcggac 


cacagcagca 


acaacaacaa 


cctccgcagc 


caccgatgaa 


caacgggcag 


1020 


atgggtcctc 


ctcctctgca 


cagtccgctc 


ggaaacggac 


caacgggtca 


tggcagtcac 


1080 


atgcctggag 


gaccaatccc 


aggaccaggt 


cctgggcctg 


gcggcctagt 


aggtcccggt 


1140 


ggcatctccc 


ccgcgcacgg 


caataacccg ggtggttctg ggaacaacat gctcggcggg 


1200 


aatcccggcg gcggcaacag 


caacaacaac 


ggaagcaata caagtaacgc 


cagcaacaac 


1260 


aatcaaaatc 


ctcacctctc 


gccagcagcc ggacgcctgg gagtgccgac gtcgatgcag 


1320 


tcgaatggac 


cttcggtatc 


atcggtagcc 


tcctcatcgg 


ttccctogcc 


cgccacgccc 


1380 
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acgctcacgc 


ccacatcgac 


ggccacgtcc 


atgtccacgt 


cagtgcctac 


atcctcgcca 


1440 


crcaccciccccx 

*— 1 w ^ V**- V— - ~H V** w 


ccatgtcacc 


gcatcactcg 


ctaaacagcg 


ccggcccgag 


tccgggcatg 


1500 


rrcaactcoo 


gacccagccc 


actcicacrtca 


ccagccggac 


ccaatggccc 


caataacaac 


1560 


clciL-a.y^O-ci L, d 




acccafccrafcci 


ggccagatga 


tcccgaacgc 


agttcctatg 


1620 


CagCaCCayc 


cty LauaLyyy 




cctggccacg 


ggcccggacc 


aatgcccgga 


1680 


acgggcauga 






caaccctccc atcttggtcc 


cccgcatccg 


1740 


aatatgafcga 


aCCaCCCyCa 




catcatcctg 


gcggaccacc 


crcccrcacatcr 


1800 


«Eyyy*-ggac 


Luyyaatyua. 


yy yy 


gctggaatgc 


ctcctcatat 


crcrcrccicracrcra 


1860 


cctaatccgc 


acatgatggg 


yy LL.t.y ^ctu 


gggaacgcgg 


gtccgcacat 


aaaccacaoc 


1920 


cacatgggtg 


gagtaccagg 


4" ^ a ^ r*r* 

cccayyaccc 


ggacccggcg 


gcatgaacgg 




1980 


ccgcacatgt 


ccccgcacca 


CygaCatccg 


catcaccacc 


acaatccgat 


yyy^yy ^*^*' a 


2040 


ggtccaaata 


tgttcggcgg 




ggtcccatgg 


gtcccggtgg 


d.u'-yauyyyu 


dZ -L \J \J 


aacatggggc 


ccatgggagg 


tggcccgatg 


ggcggcccta 


tgggcgtagg 


tcccaagccg 


2160 


atgacaatgg gcggcgggaa 


gatgtacccg 


ccgggacagc 


caatggtctt 


taatccgcag 


2220 


aacccgaatg 


cgccgcccat 


atatccttgt 


ggcatgtgcc 


acaaggaggt 


gaacgacaac 


2280 


gacgaagccg 


tgttctgtga 


afcccggttgt 


aactttttct 


ttcacagaac 


ctgtgfctggc 


2340 


ctgacagagg 


cggccttcca 


aatgctcaac 


aaggaggtgt 


ttgccgagtg 


gtgctgcgac 


2400 


aagtgcgtgt 


cttccaagca 


tattcccatg 


gtcaagttca 


agtgttga 




2448 



<210> 2 

<211> 1366 

<212> DNA 

<213> Human 

<400> 2 

ggatccccac atgcccgccg 
tggtgatagt ggactggatg 
gaaaaagcgc aaggcaaata 
accaccgaat ccaaactctg 
tactatttcc tataaaccac 
ctttggaggc tatagtacafc 
atactgtggt ccttactcac 
catgggtttt aatcgacctc 
taatccatct tataataatg 
acaaaatcct gctgaaaatt 



agaactctcc 
ggttaggagg 
cacagggacc 
accatctagt 
taccttcgtc 
tcagaatgcc 
tcaggaacca 
atgcttttaa 
cactaagtca 
tcagtcaaat 



agctcccgct 
accaggtgta 
ttctttccct 
ggctgctaat 
aaatccatat 
acctcacgtt 
gccacaccca 
ctfctgggcca 
gaatgtcaac 
tcctccacag 



tacaaagttt 

caactaggaa 

ccattgtctg 

ccatttgatg 

cttggccctg 

cccccaagaa 

tttcctcaga 

catgataatt 

atgcctaatc 

aatgctagcc 



cctcgcatgg 
gcccagataa 
agtatgctcc 
acaactataa 
gttatcctgg 
tgtcttcccc 
atcctctggg 
caagtttcgg 
aacattttag 
aagtttctaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ccccgatttg gcatctaatt ttgttcctgg aaataattca aattttactt ctccgttaga 660 

atctaatcat tcttttattc ctcccccaaa cacttttggt caagcaaaag caccaccccc 720 

aaaacaagac tttactcaag gagcaaccaa aaacactaat caaaattcct ctgctcatcc 78 0 

acctcacttg aatatggatg acacagtgaa tcagagtaat attgaattaa aaaatgttaa 840 

tcgaaacaat gcagtaaatc aggagaacag ccgttcaagt agcactgaag ccacaaacaa 900 

taaccctgca aatgggacgc agaataagcc acgacaacca agaggtgcag cagatgcctg 960 

caccacagaa aaaagcaata aatcctctct tcacccaaac cgtcatggcc attcgtcttc 1020 

tgacccagtg tatccttgtg gaatttgtac aaacgaggtg aacgatgatc aggatgccat 1080 

cttatgtgag gcctcttgtc agaaatggtt tcatcggatc tgtactggaa tgactgaaac 1140 

agcttatggc ctcttaactg cagaagcatc tgcagtatgg ggctgtgata cctgtatggc 1200 

tgacaaagat gtccagttaa tgcgtactag agaaactttt ggtccatctg cagtgggcag 1260 

tgatgcttaa tcaaaggcat taactaaagt gggtttattt tcctgtgcat tgcagaagtt 1320 

cattgacaca ggattttaat gttttacatt atttttttaa atgcat 13 66 

<210> 3 
<211> 1449 

<212> DNA 1 
<213> Human 

<400> 3 

cccgggtccc ccactccatg gccgcctcgg cgccgccccc accggacaag ctggagggag 60 

gtggcggccc cgcaccgccc cctgcgccgc ccagcaccgg gaggaagcag ggcaaggccg 120 

gtctgcaaat gaagagtcca gaaaagaagc gaaggaagtc aaatactcag ggccctgcat 180 

actcacatct gacggagttt gcaccacccc caactcccat ggtggatcac ctggttgcat 240 

ccaacccttt tgaagatgac ttcggagccc ccaaagtggg ggttgcagcc cctccattcc 3 00 

ttggcagtcc tgtgcccttc ggaggcttcc gtgtgcaggg gggcatggcg ggccaggtac 360 

ccccaggcta cagcactgga ggtggagggg gcccccagcc actccgtcga cagccacccc 420 

ccttccctcc caatcctatg ggccctgctt tcaacatgcc cccccagggt cctggctacc 480 

cacccccagg caacatgaac tttcccagcc aacccttcaa ccagcctctg ggtcaaaact 540 

ttagtcctcc cagtgggcag atgatgccgg gcccagtggg gggatttggt cccatgatct 600 

cacccaccat gggacagcct cccagagcag agctgggccc accttctctg tcccaacgat 660 

ttgctcagcc aggggctcct tttggccctt ctcctctcca gagacctggt caggggctcc 720 

ccagcctgcc gcctaacaca agtccctttc ctggtccgga ccctggcttt cctggccctg 780 

gtggtgagga tggggggaag cccttgaatc cacctgcttc tactgctttt ccccaggagc 840 

cccactcagg ctccccggct gctgctgtta atgggaacca gcccagtttc cccccgaaca 900 



WO 02/077023 
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gcagtgggcg gggtgggggc actccagatg ccaacagctt ggcaccccct ggcaaggcag 960 



gtgggggctc 


cgggccccag cctcccccag gcttggtgta cccatgtggt 


gcctgtcgga 


1020 


gtgaggtgaa 


cgatgaccag gatgccattc tgtgtgaggc 


ctcctgccag 


aaafcggttcc 


1080 


accgfcgagtg 


cacaggcatg actgagagcg cctatgggct 


gctgaccact 


gaagcttctg 


1140 


ccgtctgggc 


ctgcgatctc tgcctcaaga ccaaggagat 


ccagtctgtc 


tacatccgtg 


1200 


agggcatggg 


gcagctggtg gctgctaacg atgggtgacg 


ctggtgaagt 


ggcccaggga 


1260 


agtgcacatg 


tctctccctg ctcttccagg gtgatttttt 


tgatgtttgg 


ctcttggtcc 


T "3 O f) 

JL J *s U 


ttgtttccac tggctttcca tccccatggg gcagaaacag tggctcctgg gagcagaaaa 


1380 


ggaattgagg tgggcaggca gaagagcctg gattgctcac 


tgttttggga 


aacttacatg 


1440 


ttgagatct 








1449 


<210> 4 
<211> 1254 
<212> DNA 
<213> Mouse 








<400> 4 
atgtcagcgg 


aacaggacaa ggagcccatc gcgctgaaga gagttagagg 


tggtgacagt 


6 0 


ggactggatg 


ggttaggagg gcccaatata caactcggaa gcccagataa gaaaaaacgc 


ion 

JL U 


aaggccaaca 


cacagggatc tfccctttcct ccgctgtcag agtacgcgcc 


accaccgaat 


-LOU 


ccaaactccg 


accatctagt ggctgccaat ccgttcgatg acagctacaa 


caccatttcc 




tataaaccac 


tgccttcatc taatccatat cttggccctg ggtatcctgg 


ctttggaggc 


~k on 

J U u 


tacagcacat 


tcagaatgcc accccacgtc cctccaagaa tgtcttctcc 


ctactgtggt 


360 


ccttactcac 


tcaggaatca gccccaccca tttcctcaga atccgttggg 


catgggcttt 


420 


aaccggcctc 


atgcttttaa ctttgggccg catgataatt 


cgaafctttgg 


aaacccacct 


480 


tataataatg 


tactgactca ggacattaac atgcccggtc 


agcattttag 


acaaggctct 


540 


gctgaaaact 


tcagtcagat tcccccgcag aatgttggcc 


aagtgtctaa 


ccctgacctc 


600 


gcatctaatt 


ttgcccctgg gaataattca aattttacct 


ctccgttaga 


aacgaatcat 


660 


tcgtttattc 


cacccccaaa cgcgtttggc caagcaaaag 


ctccacttcc 


caaacaagac 


720 


ttcactcaag 


gggcaaccaa aaccccgaat cagaattcgt 


ccactcaccc 


acctcaccta 


780 


aatatggagg 


atccagtcaa tcagagtaac gtcgagttaa 


aaaatgtcaa 


cagaaacaac 


840 


gttgtccaag 


agaacagccg ttcgggcagc gcagaggcca 


ccaacaacca 


tgcgaatggg 


900 


acccagaaca 


agccccggca gcccaggggc gcagctgacc 


tgtgcacccc 


cgacaaaagc 


960 


cgcaagttct 


ccctgctccc cagccggcat ggccattcct 


cctctgaccc 


tgtgtacccg 


1020 


tgcgggattt 


gtacaaatga agtgaatgac gatcaggacg ccattctgtg tgaagcctct 


1080 



WO 02/077(123 
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tgtcagaagt ggtttcatcg catctgcact ggaatgaccg aaacagccta cgggctcctg 
acagcggaag catccgcagt gtggggctgt gacacgtgca tggctgacaa ggatgtccag 
ctcatgcgca ctagagaggc ctttggtcca cctgccgtgg gcggcgatgc ctaa 



1140 
1200 
1254 



<210> 5 

<211> 1221 

<212> DNA 

<213> Mouse 

<400> 5 

atggccgcct cggcgccgcc 
ccccccgcgc cgcccagcac 
cctgaaaaga agcgaagaaa 
ttcgccccac ccccgacccc 
gatttcggag cccctaaagt 
tttggaggct ttcgtgtaca 
ggaggaggag ggggtcccca 
atgggfcccag cttttaatat 
atgaactttc ccagtcaacc 
gggcaggtga tgccaggccc 
cagccfcccta gaggggagct 
gcaccttatg gtccttctct 
acaagtccct tccctggtcc 
aagcccttga acccaccggc 
gctgctgctg tcaatgggaa 
ggcactccag atgccaacag 
cagcctcccc caggcctggt 
caggatgcca ttctgtgtga 
atgaccgaga gtgcctacgg 
ctttgccfcca agaccaagga 
gtggctgcta acgatgggtg 



cccaccggac 
cgggaggaag 
gtccaatact 
catggtggat 

ggggggcgca 

ggggggcatg 

gccacttcgt 

gccccctcag 

cttcaaccag 

agtaggcgga 

gggtcctcct 

tcagagacct 

agaccctggt 

tcccaccgcc 

tcagcccagt 

tctggcaccc 

gtacccctgc 

ggcctcctgc 

cctgctgacc 

gatccagtct 



aagctggagg 

cagggcaagg 

cagggtcctg 

catctggtcg 

ggccctccgt 

gcaggccagg 

cggcagcccc 

ggtccctggg 

tctctgggcc 

tttggtccca 

cctctccccc 

ggtcagggac 

tttcctggac 

tttccccagg 

ttccccccta 

cccggcaagg 

ggtgcctgcc 

cagaagtggt 

accgaggcct 

gtctacatcc 



gaggcagcgg 

ccggtctgca 

catattcaca 

cttctaaccc 

tcctcggcag 

tacccccaag 

ctccttttcc 

gtaccccgcc 

aaaactttag 

tgatctcacc 

aacgctttac 

tcacccagct 

ctggcggtga 

aagcaccatt 

gcagcagtgg 

cagggggagg 

gtagtgaggt 

ttcaccgcga 

ctgccgtctg 

gagagggcat 



ccccgcaccg 

aatgaagagc 

tctgacggag 

ttttgaggat 

tccggtgccc 

ctacggcact 

ccccagccct 

ccctggcaac 

cccacctggt 

gaccatggga 

ccaaccagga 

gccctccaac 

ggatggtggg 

cgggctcccc 

tcgaggtggg 

ctcagggccc 

aaatgatgac 

gtgcaccggc 

ggcctgtgat 

gggccagttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
12O0 
1221 



<210> 6 

<211> 815 

<212> PRT 

<213> Drosophila melanogaster 



<400> 
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Met Thr His Asn Leu Gly Met Ala Pro Tyr Arg Leu Pro Gly Pro Ala 
15 10 15 



Gly Gly Leu Cys Pro Pro Asp Phe Lys Pro Pro Pro Pro Thr Asp lie 
20 25 30 



lie Ser Ala Pro Ser Asn Pro Lys Lys Arg Arg Lys Thr Ser Ser Ala 
35 40 45 



Ala Asn Ser Ala Ala Ala Val Ala Ala Ala Ala Ala Ala Ala Ala Ala 
50 55 60 



Ala Asn Ser Met Gin Gin Gin Gin Ala Pro Pro Thr Pro Gin Asp Leu 
65 70 75 80 



Leu Pro Pro Pro Pro Met Gly Gly Phe Gly Asp Thr lie lie Ala Ser 
85 90 95 



Asn Pro Phe Asp Asp Ser Pro Gin Val Ser Ala Met Ser Ser Ser Ala 
100 105 110 



Ala Ala Ala Met Ala Ala Met Asn Gin Met Gly Gly Gly Pro Gly Gly 
115 120 125 



Gly His Phe Gly Gly Gly Gly Pro Gly Gly His Pro His Trp Glu Asp 
130 135 140 



Arg Met Gly Met Gly Gly Gly Pro Pro Pro Pro Pro His Met His Pro 
145 150 155 160 



His Met His Pro His His Pro Gly Gly Pro Met Gly His Pro His Gly 
165 170 175 



Pro His Pro His Met Gly Gly Pro Pro Pro Met Arg Gly Met Ser Pro 
180 185 190 



Met His Pro His Gin Met Gly Pro Gly Pro Gly Val Gly Leu Pro Pro 
195 200 205 



His Met Asn His Gly Arg Pro Gly Gly Pro Gly Gly Pro Gly Gly Pro 
210 215 220 



Val Pro Met Gly Ser Pro Met Gly Gly lie Ala Gly Met Gly Gly Met 
225 230 235 240 



Ser Pro Met Gly Gly Met Gly Gly Pro Ser lie Ser Pro His His Met 
245 250 . 255 
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Gly Met Gly Gly Leu Ser Pro Met Gly Gly Gly Pro Asn Gly Pro Asn 
260 265 270 



Pro Arg Ala Met Gin Gly Ser Pro Met Gly Gly Pro Gly Gin Asn Ser 
275 280 285 



Pro Met Asn Ser Leu Pro Met Gly Ser Pro Met Gly Asn Pro lie Gly 
290 295 300 



Ser Pro Leu Gly Pro Pro Ser Gly Pro Gly Pro Gly Asn Pro Gly Asn 
305 310 315 320 



Thr Gly Gly Pro Gin Gin Gin Gin Gin Gin Pro Pro Gin Pro Pro Met 
325 330 335 



Asn Asn Gly Gin Met Gly Pro Pro Pro Leu His Ser Pro Leu Gly Asn 
340 345 350 



Gly Pro Thr Gly His Gly Ser His Met Pro Gly Gly Pro lie Pro Gly 
355 360 365 



Pro Gly Pro Gly Pro Gly Gly Leu Val Gly Pro Gly Gly lie < Ser Pro 
370 375 380 



Ala His Gly Asn Asn Pro Gly Gly Ser Gly Asn Asn Met Leu Gly Gly 
385 390 395 400 



Asn Pro Gly Gly Gly Asn Ser Asn Asn Asn Gly Ser Asn Thr Ser Asn 
405 410 415 



Ala Ser Asn Asn Asn Gin Asn Pro His Leu Ser Pro Ala Ala Gly Arg 
420 425 430 



Leu Gly Val Pro Thr Ser Met Gin Ser Asn Gly Pro Ser Val Ser Ser 
» 435 440 445 



Val Ala Ser Ser Ser Val Pro Ser Pro Ala Thr Pro Thr Leu Thr Pro 
450 455 460 



Thr Ser Thr Ala Thr Ser Met Ser Thr Ser Val Pro Thr Ser Ser Pro 
465 470 475 480 



Ala Pro Pro Ala Met Ser Pro His His Ser Leu Asn Ser Ala Gly Pro 
485 490 495 



Ser Pro Gly Met Pro Asn Ser Gly Pro Ser Pro Leu Gin Ser Pro Ala 
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500 505 510 



Gly Pro Asn Gly Pro Asn Asn Asn Asn Ser Asn Asn Asn Asn Gly Pro 
515 520 525 



Met Met Gly Gin Met lie Pro Asn Ala Val Pro Met Gin His Gin Gin 
530 535 540 



His Met Gly Gly Gly Pro Pro Gly His Gly Pro Gly Pro Met Pro Gly 
545 550 555 560 



Met Gly Met Asn Gin Met Leu Pro Pro Gin Gin Pro Ser His Leu Gly 
565 570 575 



Pro Pro His Pro Asn Met Met Asn His Pro His His Pro His His His 
580 585 590 



Pro Gly Gly Pro Pro Pro His Met Met Gly Gly Pro Gly Met His Gly 
595 600 605 



Gly Pro Ala Gly Met Pro Pro His Met Gly Gly Gly Pro Asn Pro His 
610 615 620 



Met Met Gly Gly Pro His Gly Asn Ala Gly Pro His Met Gly His Gly 
625 630 635 640 



His Met Gly Gly Val Pro Gly Pro Gly Pro Gly Pro Gly Gly Met Asn 
645 650 655 



Gly Pro Pro His Pro His Met Ser Pro His His Gly His Pro His His 
660 665 670 



His His Asn Pro Met Gly Gly Pro Gly Pro Asn Met Phe Gly Gly Gly 
675 680 685 



Gly Gly Gly Pro Met Gly Pro Gly Gly Pro Met Gly Asn Met Gly Pro 
690 695 700 



Met Gly Gly Gly Pro Met Gly Gly Pro Met Gly Val Gly Pro Lys Pro 
705 710 715 " 720 



Met Thr Met Gly Gly Gly Lys Met Tyr Pro Pro Gly Gin Pro Met Val 
725 730 735 



Phe Asn Pro Gin Asn Pro Asn Ala Pro Pro He Tyr Pro Cys Gly Met 
740 745 750 



WO 02/077023 
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Cys His Lys Glu Val Asn Asp Asn Asp Glu Ala Val Phe Cys Glu Ser 
755 760 765 



Gly Cys Asn Phe Phe Phe His Arg Thr Cys Val Gly Leu Thr Glu Ala 
770 775 780 



Ala Phe Gin Met Leu Asn Lys Glu Val Phe Ala Glu Trp Cys Cys Asp 
785 790 795 800 

Lys Cys Val Ser Ser Lys His lie Pro Met Val Lys Phe Lys Cys 
805 810 815 



<210> 7 

<211> 419 

<212> PRT 

<213> Human 

<400> 7 

Met Pro Ala Glu Asn Ser Pro Ala Pro Ala Tyr Lys Val Ser Ser His 
X 5 10 15 

Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Gly Val Gin Leu 
20 25 30 



Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gin Gly Pro Ser 
3 5 40 45 



Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp 
50 55 60 



His Leu Val Ala Ala Asn Pro Phe Asp Asp Asn Tyr Asn Thr lie Ser 
65 70 . - 75 80 



Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro 
85 90 95 



Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro 
100 105 110 



Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gin Pro 
115 120 125 



His Pro Phe Pro Gin Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His 
130 135 140 



Ala Phe Asn Phe Gly Pro His Asp Asn Ser Ser Phe Gly Asn Pro Ser 
145 150 155 160 
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Tyr Asn Asn Ala Leu Ser Gin Asn Val Asn Met Pro Asn Gin His Phe 
165 170 175 



Arg Gin Asn Pro Ala Glu Asn Phe Ser Gin lie Pro Pro Gin Asn Ala 
180 185 190 



Ser Gin Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Val Pro Gly Asn 
195 200 205 



Asn Ser Asn Phe Thr Ser Pro Leu Glu Ser Asn His Ser Phe lie Pro 
210 215 220 



Pro Pro Asn Thr Phe Gly Gin Ala Lys Ala Pro Pro Pro Lys Gin Asp 
225 230 235 240 



Phe Thr Gin Gly Ala Thr Lys Asn Thr Asn Gin Asn Ser Ser Ala His 
245 250 255 



Pro Pro His Leu Asn Met Asp Asp Thr Val Asn Gin Ser Asn lie Glu 
260 265 270 



Leu Lys Asn Val Asn Arg Asn Asn Ala Val Asn Gin Glu Asn Ser Arg 
275 280 285 



Ser Ser Ser Thr Glu Ala Thr Asn Asn Asn Pro Ala Asn Gly Thr Gin 
290 295 300 



Asn Lys Pro Arg Gin Pro Arg Gly Ala Ala Asp Ala Cys Thr Thr Glu 
305 310 315 320 



Lys Ser Asn Lys Ser Ser Leu His Pro Asn Arg His Gly His Ser Ser 
325 330 335 



Ser Asp Pro Val Tyr Pro Cys Gly lie Cys Thr Asn Glu Val Asn Asp 
340 345 350 



Asp Gin Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys Trp Phe His 
355 360 365 



Arg He Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala 
370 375 380 



Glu Ala Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp 
385 390 395 400 



Val Gin Leu Met Arg Thr Arg Glu Thr Phe Gly Pro Ser Ala Val Gly 
405 410 415 
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Ser Asp Ala 



<210> 6 

<211> 406 

<212> PRT 

<213> Human 

<400> 8 



Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Gly 
1 5 10 15 



Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gin Gly 
20 25 30 



Lys Ala Gly Leu Gin Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser 
35 40 45 



Asn Thr Gin Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro 
50 55 60 



Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp 
65 70 75 . 80 



Asp Phe Gly Ala Pro Lys Val Gly Val Ala Ala Pro Pro Phe Leu Gly 
" 85 90 95 



Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gin Gly Gly Met Ala Gly 
100 105 110 



Gin Val Pro Pro Gly Tyr Ser Thr Gly Gly Gly Gly Gly Pro Gin Pro 
115 120 125 



Leu Arg Arg Gin Pro Pro Pro Phe Pro Pro Asn Pro Met Gly Pro Ala 
130 135 140 



Phe Asn Met Pro Pro Gin Gly Pro Gly Tyr Pro Pro Pro Gly Asn Met 
145 150 155 160 



Asn Phe Pro Ser Gin Pro Phe Asn Gin Pro Leu Gly Gin Asn Phe Ser 
165 170 175 



Pro Pro Ser Gly Gin Met Met Pro Gly Pro Val Gly Gly Phe Gly Pro 
180 185 190 



Met lie Ser Pro Thr Met Gly Gin Pro Pro Arg Ala Glu Leu Gly Pro 
195 200 205 



BNSDOCJD: <WO. 



02077023A2_L> 



WO 02/077023 PCT/CH 02/00063 

1 2 



Pro Ser Leu Ser Gin Arg Phe Ala Gin Pro Gly Ala Pro Phe Gly Pro 
210 215 220 



Ser Pro Leu Gin Arg Pro Gly Gin Gly Leu Pro Ser Leu Pro Pro Asn 

225 230 235 240 

Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly 

245 250 255 



Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Ser Thr Ala Phe Pro 
260 265 270 



Gin Glu Pro His Ser Gly Ser Pro Ala Ala Ala Val Asn Gly Asn Gin 
275 280 285 



Pro Ser Phe Pro Pro Asn Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp 
290 295 300 



Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro 
305 310 315 320 



Gin Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu 
325 330 335 



Val Asn Asp Asp Gin Asp Ala He Leu Cys Glu Ala Ser Cys Gin Lys 
340 345 350 



Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu 
355 360 365 



Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys 
370 375 380 



Thr Lys Glu He Gin Ser Val Tyr He Arg Glu Gly Met Gly Gin Leu 
385 390 395 400 



Val Ala Ala Asn Asp Gly 
405 



<210> 9 

<211> 417 

<212> PRT 

<213> Mouse 

<400> 9 



Met Ser Ala Glu Gin Asp Lys Glu Pro He Ala Leu Lys Arg Val Arg 
15 10 15 
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Gly Gly Asp Ser Gly Leu Asp Gly Leu Gly Gly Pro Asn lie Gin Leu 
20 25 30 



Gly Ser Pro Asp Lys Lys Lys Arg Lys Ala Asn Thr Gin Gly Ser Ser 
3 5 40 45 



Phe Pro Pro Leu Ser Glu Tyr Ala Pro Pro Pro Asn Pro Asn Ser Asp 
50 55 60 

His Leu Val Ala Ala Asn Pro Phe Asp Asp Ser Tyr Asn Thr lie Ser 
65 70 75 80 

Tyr Lys Pro Leu Pro Ser Ser Asn Pro Tyr Leu Gly Pro Gly Tyr Pro 
85 90 95 



Gly Phe Gly Gly Tyr Ser Thr Phe Arg Met Pro Pro His Val Pro Pro 
100 105 110 

Arg Met Ser Ser Pro Tyr Cys Gly Pro Tyr Ser Leu Arg Asn Gin Pro 
115 120 125 

His Pro Phe Pro Gin Asn Pro Leu Gly Met Gly Phe Asn Arg Pro His 
130 135 * 140 

Ala Phe Asn Phe Gly Pro His Asp Asn Ser v Asn Phe Gly Asn Pro Pro 
145 150 155 160 

Tyr Asn Asn Val Leu Thr Gin Asp lie Asn Met Pro Gly Gin His Phe 
165 170. 175 



Arg Gin Gly Ser Ala Glu Asn Phe Ser Gin lie Pro Pro Gin Asn Val 
180 185 190 



Gly Gin Val Ser Asn Pro Asp Leu Ala Ser Asn Phe Ala Pro Gly Asn 
195 200 205 



Asn Ser Asn Phe Thr Ser Pro Leu Glu Thr Asn His Ser Phe lie Pro 
210 215 220 

Pro Pro Asn Ala Phe Gly Gin Ala Lys Ala Pro Leu Pro Lys Gin Asp 
225 230 235 240 

Phe Thr Gin Gly Ala Thr Lys Thr Pro Asn Gin Asn Ser Ser Thr His 
245 250 255 



Pro Pro His Leu Asn Met Glu Asp Pro Val Asn Gin Ser Asn Val Glu 
260 265 270 
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Leu Lys Asn Val Asn Arg Asn Asn Val Val Gin Glu Asn Ser Arg Ser 
275 280 285 



Gly Ser Ala Glu Ala Thr Asn Asn His Ala Asn Gly Thr Gin Asn Lys 
290 295 300 



Pro Arg Gin Pro Arg Gly Ala Ala Asp Leu Cys Thr Pro Asp Lys Ser 
305 310 315 320 



Arg Lys Phe Ser Leu Leu Pro Ser Arg His Gly His Ser Ser Ser Asp 
325 330 335 



Pro Val Tyr Pro Cys Gly lie Cys Thr Asn Glu Val Asn Asp Asp Gin 
340 345 350 



Asp Ala lie Leu Cys Glu Ala Ser Cys Gin Lys Trp Phe His Arg He 
355 360 365 



Cys Thr Gly Met Thr Glu Thr Ala Tyr Gly Leu Leu Thr Ala Glu Ala 
370 375 380 



Ser Ala Val Trp Gly Cys Asp Thr Cys Met Ala Asp Lys Asp Val Gin 
385 390 395 400 



Leu Met Arg Thr Arg Glu Ala Phe Gly Pro Pro Ala Val Gly Gly Asp 
405 410 415 



Ala 



<210> 10 

<211> 406 

<212> PRT 

<213> Mouse 

<400> 10 

Met Ala Ala Ser Ala Pro Pro Pro Pro Asp Lys Leu Glu Gly Gly Ser 
15 10 15 



Gly Pro Ala Pro Pro Pro Ala Pro Pro Ser Thr Gly Arg Lys Gin Gly 
20 25 30 



Lys Ala Gly Leu Gin Met Lys Ser Pro Glu Lys Lys Arg Arg Lys Ser 
35 40 45 



Asn Thr Gin Gly Pro Ala Tyr Ser His Leu Thr Glu Phe Ala Pro Pro 
50 55 60 
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Pro Thr Pro Met Val Asp His Leu Val Ala Ser Asn Pro Phe Glu Asp 
65 70 75 80 

Asp Phe Gly Ala Pro Lys Val Gly Gly Ala Gly Pro Pro Phe Leu Gly 
85 90 95 



Ser Pro Val Pro Phe Gly Gly Phe Arg Val Gin Gly Gly Met Ala Gly 
100 . 1 05 110 



Gin Val Pro Pro Ser Tyr Gly Thr Gly Gly Gly Gly Gly Pro Gin Pro 
115 120 125 



Leu Arg Arg Gin Pro Pro Pro Phe Pro Pro Ser Pro Met Gly Pro Ala 
130 135 140 



Phe Asn Met Pro Pro Gin Gly Pro Trp Gly Thr Pro Pro Pro Gly Asn 
145 150 155 160 



Met Asn Phe Pro Ser Gin Pro Phe Asn Gin Ser Leu Gly Gin Asn Phe 
165 170 175 



Ser Pro Pro Gly Gly Gin Val Met Pro Gly Pro Val Gly Gly Phe Gly 
180 185 190 



Pro Met lie Ser Pro Thr Met Gly Gin Pro Pro Arg Gly Glu Leu Gly 
195 200 205 



Pro Pro Pro Leu Pro Gin Arg Phe Thr Gin Pro Gly Ala Pro Tyr Gly 
210 215 220 



Pro Ser Leu Gin Arg Pro Gly Gin Gly Leu Thr Gin Leu Pro Ser Asn 
225 230 * 235 240 



Thr Ser Pro Phe Pro Gly Pro Asp Pro Gly Phe Pro Gly Pro Gly Gly 
245 250 255 



Glu Asp Gly Gly Lys Pro Leu Asn Pro Pro Ala Pro Thr Ala Phe Pro 
260 265 270 



Gin Glu Ala Pro Phe Gly Leu Pro Ala Ala Ala Val Asn Gly Asn Gin 
275 280 285 



Pro Ser Phe Pro Pro Ser Ser Ser Gly Arg Gly Gly Gly Thr Pro Asp 
290 295 300 

Ala Asn Ser Leu Ala Pro Pro Gly Lys Ala Gly Gly Gly Ser Gly Pro 
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305 310 315 320 



Gin Pro Pro Pro Gly Leu Val Tyr Pro Cys Gly Ala Cys Arg Ser Glu 
325 330 335 



Val Asn Asp Asp Gin Asp Ala He Leu Cys Glu Ala Ser Cys Gin Lys 
340 345 350 



Trp Phe His Arg Glu Cys Thr Gly Met Thr Glu Ser Ala Tyr Gly Leu 
355 360 365 



Leu Thr Thr Glu Ala Ser Ala Val Trp Ala Cys Asp Leu Cys Leu Lys 
370 375 380 



Thr Lys Glu He Gin Ser Val Tyr He Arg Glu Gly Met Gly Gin Leu 
385 390 395 400 



Val Ala Ala Asn Asp Gly 
405 
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preliminary examination on matter which has not been searched. This 1s 
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